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FOREWORD 


This is the eleventh issue of the Review concerned with the general area 
of guidance and counseling, and it is the second joint effort of the Ameri- 
can Educational Research Association and the American Personnel and 
Guidance Association. This issue is the first to give systematic recognition 
to the fact that the school counselor increasingly is seen as a member of a 
pupil personnel team, who works closely with school psychologist, school 
nurse, and school social worker. In setting the field of guidance and coun- 
seling within the pupil personnel framework, the authors distinguished be- 
tween the techniques of the related pupil personnel specialists and the ra- 
tionale for their employment, the administrative framework within which 
they work, and their preparation. As a consequence, this newer emphasis 
will be found primarily in Chapter I, “Foundations”; Chapter II, *Ele- 
mentary and Secondary School Programs"; and Chapter IV, “Selection, 
Preparation, and Professionalization of Specialists." 

With but two exceptions, the topics of this issue are similar to those of 
the April 1960 issue. The chapter on higher education programs, similar 
to that in the 1954. issue, makes visible the fact that organizations at this 
level usually differ from those at the elementary or the secondary school 
level. The chapter on foundations considers not only the philosophical 
basis for guidance, counseling, and personnel services but also the socio- 
logical and cultural framework. 

The guidance and counseling uses of appraisal data are considered in 
the chapter on the appraisal function; whereas research on the more tech- 
nical aspects of tests has been evaluated in the February 1962 issue, Educa- 
tional and Psychological Testing. Although the Committee attempted to 
define the content of the chapters to avoid needless duplication of material, 
some research articles were sufficiently broad in scope to merit discussion 
in more than one context. In such cases, the varying emphases seem to 
justify multiple inclusion. 

The Chairman wishes to express appreciation to the members of the 


Committee and to the additional contributors who wrote the various 
chapters. 


Buford Stefflre, Chairman, 
Joint Committee on 


Guidance, Counseling, and Personnel Services 
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CHAPTER I 


Foundations 


CARROLL H. MILLER 


For THE purpose of this chapter, the*term foundations will be used to 
mean the societal, cultural, and philosophical context from which are 
derived, or to which are related, purposes and practices of student person- 
nel and guidance. Since this is the first time that the term foundations in 
this broad sense has constituted a chapter topic in an issue of the Review 
devoted to the area of counseling and guidance, comparisons with findings 
of earlier issues are not possible. However, philosophical foundations were 
reviewed by Wilkins and Perlmutter in the April 1960 Review. The need 
for general principles and theoretical foundations reported by these authors 
is still evident. There is, however, some indication of a rising interest in 
existentialism, which will be considered in this chapter. 


Society and Culture 


College Students 


A recent volume prepared for the Society for the Psychological Study of 
Social Issues (Sanford, 1962) provided critical reviews of research and 
some new data on such topics as characteristics of student populations, 
student society and culture, the relation of student performance to educa- 
tional objectives, and the social context of higher education. Sprague 
(1960) reported recent research on motivation, values, attitudes, and 
campus cultures. Characteristics of freshmen classes in 128 colleges and 
universities that are members of the College Entrance Examination Board 
were summarized as a series of profiles (College Entrance Examination 
Board, 1963). A report by Hall and Craigie (1962) on students who 
borrowed through the National Defense Student Loan program included 
data on age, marital status, family income, major field of study, and 
occupational plans. A five-year listing of research projects supported by 
the Cooperative Research Program (U.S. Department of Health, Education, 
and Welfare, Office of Education, 1962) reflected limited interest in investi- 
gation of college society and culture. For example, of the 416 projects 
listed, only 12 were studies of college environments or of attitudes and 
values of college students. 

Additional References: Blaesser and Crookston (1960); Goldsen and 
others (1960) ; Heist (1960); Mueller (1960); Pace (1960) ; Riesman 


(1959) ; Wilk (1960). 
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School-Age Youth 


The report of the Commission on Guidance in American Schools (Wreun. 
1962) interpreted guidance in the American setting. This important 
material on social change and changes in the schools not only described 
the present situation in broad terms but also offered projections and 
recommendations. Coleman (1962) made the final report of 
investigation of adolescents in nine high schools, 
systems. The insulation of adolescent society from the adult world sug- 
gested by the findings poses a problem for which the investigator saw 
two solutions: (a) bring the adolescent back to the home or 
adolescent society as it is and use it for educational purposes, 
sive investigation was reported by Reiss and Rhodes (1959) 
almost 22,000 adolescents in the Nashville metropolit 
focus upon factors related to conformity 
goals, the immediately obvious implicatio 
cially if delinquent behavior is defined as 
was in the National Education Associ 
1959). But the Nashville study also helps to illuminate attitudes of adoles- 
cents in general toward their own and the larger American culture. 

Havighurst and others (1962), in Growing Up in River City, reported 
a longitudinal and intensive study of one community. Almost 500 pupils 
who in 1951 were in the sixth grade (chiefly in public schools) were 
followed through 1960. The study covers many facets: the community 
itself, abilities, achievement in school, adjustment, delinquency, marriage, 
the first two years in college or in early employment, and early adult status 
and mobility. The threads of inquiry are held together by two general 
questions: (a) How well did the boys and girls succeed in the tasks of 
growing up? (b) How well did the community provide for their needs? 
A very different kind of report was made in High School Students Speak 
Out (Mallery, 1962). This book is a statement, largely in the students’ own 
words as gathered by classroom visits and in interviews, of their experi- 
ences in high school. The sample included students in eight schools in 
three regions, Using still a different approach, Friedenberg (1959) con- 
sidered the adolescent’s problem of defining himself in the social conditions 
of contemporary American life. His analysis, made largely in a Sullivanian 


framework, led him to believe that adolescence as a developmental process 
is becoming obsolete. 


Additional References: Dugan (1960) ; 


a noteworthy 
with emphasis on status 


(b) accept 
An exten- 
involving 
an area, With its 
to or deviation from cultural 
ns relate to delinquency, espe- 
norm-violating behavior, as it 
ation project (Kvaraceus and Miller. 


Findley (1960). 
Values 


Most of the studies mentioned thus far at both 
levels gave some attention to the matter of values, 
more specifically so oriented. Kli: 
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school and college 


but some others were 
nger (1962) studied the moral values of 
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U.S. and foreign male students, including Arab Moslems, Chinese non- 
Christians. Indian Hindus, South American Roman Catholics, and Turkish 
Moslems. Cross-group value structures as well as those more characteristic 
of single groups were described. Warnath and Fordyce (1961) found that 
among five groups of college freshmen who had declared majors, the 
humanities group separated from others on an aesthetic-values scale, and 
the business group on a material-values scale. Maehr and Stake (1962) 
found that students who persisted in a,theological seminary scored lower 
on aesthetic values than those who withdrew prior to graduation. Prince 
(1960) found significant differences in value scores among high school 
students in public, private, and religious schools; but within each school 
differences between groups of freshmen and seniors were not important. 
Both groups resembled the teachers in the schools in their value patterns. 
A study of values of students in seven Catholic high schools indicated that 
attitudes of lower class parents and children were most favorable toward 
belief in God and toward belief in the dignity of each individual, while 
attitudes toward these beliefs were most unfavorable among upper class 
parents and children (Del Grande, 1959). 

Additional References: Ginzberg (1961) ; McArthur (1960) ; Simpson 
and Simpson (1960). 


The Culturally Deprived 


The present strong interest in inequality of opportunity afforded in 
varied community settings is clearly indicated by the response to reports 
such as Slums and Suburbs (Conant, 1961) and Social Dynamite (National 


Committee for Children and Youth, 1961). Although problems of ine- 


quality are related to the topic of foundations, the research content of 


available reports is often minimal. 

A review of 142 projects listed in Social Dynamite suggested that while 
some have been conceived as primarily research undertakings, many were 
simply reports of ongoing programs. Riessman (1962) said that the 
Higher Horizons program of New York City has rendered an important 
service in showing that the culturally deprived can be educated. Never- 
theless, he raised the problem of whether the results reflect a Hawthorne 
effect. However, in the report by Havighurst and others (1962) men- 
tioned earlier, the authors commented that although there were several 
types of disadvantage, that of low socioeconomic status was foremost. 
The research difficulties in this area are manifold; even the very term 
culturally deprived is not easily defined. Any attempt at a summary of 
findings would be inappropriate at this time, especially since many of the 
projects are still in progress. 

Additional References: Impellizzeri (1960) ; Krugman and Impellizzeri 
(1960). 
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Philosophical Trends 


One of the most noticeable characteristics of the literature related to 
foundations is a lack of philosophical criticism. However, a few contribu- 
tions are well worth noting. The social thinking of five pioneers in the 
field was interpreted by Rockwell and Rothney (1961), Frank Parsons 
was presented as a utopian social reformer and Jessie B. Davis as an 
advocate of the social gospel; Anpa Y. Reed and Eli Weav 
fied with social Darwinism: and Davis S. Hill was seen as more concerned 
than the others with applying scientific research to the study of the indi- 
vidual and to social problems in general. 

Noting that traditional philosophy has contributed to guidance certain 
unanalyzed assumptions and unquestioned beliefs, Gowin (1961) pointed 
out that attempts to formulate a philosophy of personnel work have often 
resulted in an eclecticism of realism, idealism, rationalism, and experimen- 
talism. Most counselors, he noted, do not appear to be familiar with exis- 
tentialism and logical positivism. Existentialists tend to reject the social 
order, while positivists ignore rather than reject it. Gowin speculated about 
the kinds of answers these two orientations might give the counselor, 

Wise (1962) found in existentialis 


er were identi- 


epts of importance 
at sees the world as the 
hich the person exists and (b) 
from the concept of responsibility. 
out the relation of guidance and 
expect guidance to he on the side 


) actually, it appears to support the 
“organization man” principle. 


trying to find a wa 
emerging that mi 
be that self which one truly is. 


€ is the paper in which he 


disci 1 | * reconciling logica] Positivism with the “sub- 
jectively oriented existential thinking which seemed to fit so well 


à DERI A : ned 
with my therapeutic experience." Or, stated differently, it is the question. 
"Persons or science? 


Additional References: Lowe (1959) ; Pervin (1960) ; Shoben (1961) 
Smith (1959, 1961). 


Concepts and Purposes 


Principles and assumptions of guidance and personnel practice and the 
concepts and purposes relevant to them are, to say the least, greatly 
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varied. 
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There seems to be little discernible movement toward any generally 
accepted synthesis. Although concepts of counseling and program organi- 
zation are relevant, they are covered in other chapters. 


In the Schools 


In a statement by Bowles (1959), guidance was conceived primarily as 
educational guidance involving appraisal of the individual and educational 
placement. Cox (1961) noted three new emphases in vocational guidance: 
(a) an internal rather than an external criterion of success, (b) the work 
life as a developmental process, and (c) the relation of vocational choice 
and personality. Weitz (1961) interpreted guidance as behavioral change 
and suggested a model, which he applied to testing and counseling. Mathew- 
son (1961) suggested four dimensions in guidance practice: (a) classroom- 
centered-specialized, (b) self-integrative-prescriptive, (c) personal-social, 
and (d) subjective-objective. In a somewhat similar manner, Glanz (1961) 
identified four patterns: (a) centralized specialism, (b) decentralized 
generalism, (c) curricular counseling and guidance (group guidance), 
and (d) human relations and group work (mental health). 

In a historical study of guidance in Ohio since 1900, Hummel (1960) 
found one of the trends to be an emphasis on self-realization and freedom 
in choice making. Ginzberg (1960) questioned three assumptions com- 
monly made: (a) guidance should deal with the whole person; (b) the 
counselor has a responsibility for helping the individual to understand 
himself; and (c) guidance is a specific field, and the counselor an expert. 
No evidence is available by which to judge the extent to which schools 
generally have attempted to clarify and state their purposes, but a study in 
Ohio found that only 32 percent of 958 schools surveyed indicated that 
they had formulated statements of guidance philosophy—statements which 
varied considerably in orientation (Peters, 1961). 

Additional References: Barry and Wolf (1962) : Council of Chief State 
School Officers (1960) ; Cribbin (1961) ; Hilton (1959). 


In Colleges and Universities 


The past three years have yielded few critical reconsiderations of pur- 
poses and policy of student personnel work. Wise (1961) suggested that 
for the past 30 years the student personnel *movement" has neglected both 
historical roots and new developments so that a cult has arisen “isolated 
al influences in American education, particularly 
ller (1961) saw student personnel work as “an 
ts roots in psychology and sociology.” In one 
t of teaching, with funclions related to the 
differentiating the specific objectives of 
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fron: many of the centr 
higher education.” Mue 
applied social science with i 
sense, it is a special aspec 
objectives of higher education but 


REVIEW or EDUCATIONAL RESEARCH Vol. XXXIII, No. 2 


its own functions, which center about the integration of campus life. In 
his latest book, Williamson (1961) wrote of student personnel work “as a 
tangible expression in the program of a basic philosophy. * He described 
five aspects of this point of view: (a) “holism,” or concern with all aspects 
of development; (b) recognition of “the unique individuality of each 
individual”; (c) belief that in the education of some students, classroom 
teaching is not enough; (d) use of methods and relationships in an educa- 
tive rather than an authoritarian vay; and (e) utilization of new knowl- 
edge from many fields. The chief emphasis of Williamsoa's book, which was 


concerned with a philosophy of administration, actually is beyond the 
scope of this chapter. 


Summary 


Research in the area of foundations still lacks integration in terms of 
any large concept or set of concepts, Clearly, one immediate need is a defi- 
nition that will set limits on the area of foundations. Within the definition 
arbitrarily adopted in this chapter, one focus of rescarch that seems to be 
emerging centers about peer society and culture as they exist in schools and 
colleges. This focus is clearer now at the college than at the secondary 
school level. 

A conspicuous feature of the literature concerned with the college level 
has been the appearance of broad syntheses of research in certain areas. 
At the secondary school level, reports of specific problems by individual 
investigators have been more common. Statements 


of concepts of guidance 
in schools and challenges of these have been so varied that they defy an 
interpretation of trends; 


they seem to reflect a. welter of current opinion 

and practice, 
On the level of philosophic inquiry, 
dominant than in the past, and some trend 
ism is appearing. Perhaps this reflects an influence from emphasis on 


therapeutic counseling, or it may reflect existential elements in the broad 
context of contemporary culture. 


eclecticism appears to be less 
toward an interest in existential- 
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CHAPTER II 


Elementary and Secondary School Programs 


HAROLD J. REED and BUFORD STEFFLRE 


i 
Ix THE April 1960 Review, the last issue on the organization and adminis- 
tration of guidance services, Dugan noted the paucity of research on 
guidance policy, staff qualifications, budgets, facilities, and counselor 
duties. The past three years were no exception. 

Some of the areas— facilities, budgets, 
—which were reported then as minimal activities have not changed. 
Emphasis on some elements, such as identification and development. of 
individual potential, elementary guidance services, and, 
duties and functions of counselors, is greater now than during the period 
studied by Dugan. Some areas have appeared in the literature for the first 
time during the past three years. Data-processing techniques and the 


increasing reference to pupil personnel services instead of guidance services 
are examples. 


À greater number of areas have been in 
literature as guidance services hav 


referrals, and cumulative records 


especially, the 


vestigated and reported in recent 

e expanded. However, the specific con- 
tributions in organization and administration are disproportionately low. 
Of 380 projects initiated under the Cooperative Research Program (U.S. 
Department of Health, Education, and Welfare, Office of Education, 1962) 
during the five-year period from 1956 to 1961, only 22 were classified under 
counseling and guidance, and only one title suggested a research project in 
organization and administration of guidance services, 

Whether one examines organization and administration practices for 
validity or as criteri ome predictive methodology, one 
finds that little research is being done. However, the picture may be only a 
eans relationship may be operating; and 
lency may, in fact, reflect a 
cisions on the basis of research 
personnel selection, and ot 
program. If, on the other 
that organiz 
searched an 
“it-is-agreed” basis, 


her prerequisite 
hand, it is found 
ation, job functions, and pro- 
d that administrative decisions 
then there is some c. 


1 ause for 
alarm. 


Organization and Administration of Guidance Services 


Guidance services as measured by imp 
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counselor standards have been aided by the National Defense Education 
Act (NDEA) program since 1958 (Crummel, 1961). The number of 
counselors has increased by more than 50 percent in two vears. State 
department guidance workers have increased by 150 percent. 

Current administrative needs may be assessed by noting the use of NDEA 

funds. The distribution of the 185 district and county NDEA projects 
reported in California by the California State Department of Education 
(1902a) included (a) 150 to increase professional staff time. (b) 138 to 
increase clerical time, and (c) 75 to provide parent conferences. 
i Another research study reported by the California State Department of 
Education (1962d) showed the influence of professional credentialing on 
the quality of counseling. Since 1956, California has required a pupil 
personnel services credential for those performing counseling services as 
p maot job function. A survey of 81 percent of the schools revealed that 
55 percent of all counseling is performed by the 42 percent of counselors 
who are credentialed. Of those who are not credentialed, 62 percent are 
taking courses. Students in medium-sized schools are more apt to receive 
counseling from well-trained and full-time counselors than those in small 
or large schools. 


Types of Counselor Assignments 


relating to the assignment of counselors has 
appeared, and much more is needed. When responses of students on coun- 
seling wanted and the values of help received were classified by organiza- 
tion of services—that is, full-time counseling, part-time counseling, and 
counseling by the teacher with no released time—it was found that students 
in teacher-counseled classes asked for more help than those in the other 
two organizational systems, but counselees of full-time counselors reported 
more help received (Thomson, 1960). Loughary (1959) examined some 
organizational systems and concluded that in large schools where a choice 
is possible secondary school administrators do not agree on the relative 
merits of full-time counselors as opposed to teacher-counselors, The ration- 
ale for this conclusion, according to Loughary, is that specialists might 
lose sight of major educational objectives. Advocates of the full-time coun- 
selor system argue that specialized skills are necessary. 

Without attempting to compare the relative merits of full-time versus 
part-time counseling, the California State Department of Education 
(1962c) found that in 645 high schools surveyed, 21 percent of the coun- 
duled full time and 59 percent, half time or more. 


Some needed research 


selors were sche 


Use of Ratios 


Ratio is the most frequent technique used for assessing counseling serv- 
ices, but an erroneous assumption is made if ratio is not computed on 
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the basis of credentialed or qualified counselors. The California State 
Department of Education (1962b) discovered that the ratio of 1 full-time 
counselor to 412 secondary students in 81 percent of the state's high schools 
changed to 1 to 745 when only credentialed or qualified counse | > 
used to compute ratios. Districts under 500 and over 15,000 had a ratio of 
1 qualified counselor to 1,300 students, while other districts averaged l to 
600. Ratios can also be misleading if, for example, one district assigns 
attendance services to an attendance worker, while 
ance to the counselor. Discipline may be handle 
school and by the assistant princip 
ratios have little or no meaning in these and other similar situations. 

The Wrenn (1962) report attempted to consider 
factors by stating that a counseling progr 
number of qualified counselors is p 


lors were 


another assigns attend- 
d by the counselor in one 
al in another. Obviously, comparative 


misleading ratio 
am exists only when a sufficient 
esent at a school t 
counseling needs of all students, According to 
selor Aeweniahinn (ASCA) recommendation, 


includes counseling students, e 


to take care of the 
the American School Coun- 
program 


T studying character- 


if the counseling 
onsulting with teachers 
istics of student population, and coordin 


ating services in the school and 
with the community, then the ratio in high school should be l to 300 and, 
in elementary school, 1 to 450. 


Use of Data-Processing Equipment 


A significant contribution to the administration of guidance 
including the utilization of profession 

increase in the use of dat 
and using essenti 


services, 
al personnel, has been the rapid 
Gathering, recording, 
en considered a high- 
emonstrating the effec- 
r B at an accelerating rate (Kinling, 1960). 
A pilot data-processing center has been in operation for two years in Cali- 
fornia, following a study that suggested that processing data was more 
efficient in 10 districts using machines than in 10 districts using manual 
methods (Grossman, 1962). Machines established their superiority in 
processing pupil personnel data and utilizing counselor time in clerical 
duties—29 percent for data Processing and 39 percent for schools using 
manual procedures. It was tentatively concluded that financial economy 
has been overestimated and that human economy was the real gain. 


Special Services 


surveyed large districts and found 
» Psychological, mental health, and 
or the exceptional child, However, 


indicated that 7.5 percent of the 
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pupils could benefit from mental health services and less than 2 percent 
were receiving help. Jampolsky (1961), as a psychiatric consultant, de- 
scribed a successful elementary school program for disturbed children 
involving a special class for pupils, group therapy for parents, and a 
child study program for teachers. 


Timing in Introducing Guidance Services 
1 


Another problem that requires the tance of related professional con- 
tributions is determining the appropriate time to introduce certain guidance 
E ies tet and career guidance programs, for example, have 
ead Kien uced during recent years at grade levels earlier than the twelfth 
g a e, in which they had been traditional (Sinick and Hoppock, 1960). 
bic T eia been partially responsible: (a) the increasing emphasis 
" : ication ol the talented at a level where there is more talent 
tn proportion to sige of age greep in the total population and (b) the 

l i j services A8 different 


A o E of counseling and psychologica 
Es : performed by the teacher. 
Noe some evidence that research in developmental psychology is 
A g program decisions, but not sufficiently. Progress reports on the 
vocational maturity study of ninth grade boys have demonstrated the value 
of submitting theories and hypotheses to experimentation (Super, 1960; 
Super and Overstreet, 1960). The authors concluded that ninth grade boys 
were ready to explore, but not to make decisions—a finding that suggests 
the objectives and content of guidance courses. Gribbons (1960) found 
that youth at intermediate and junior high school age can benefit from 
information and opportunities to synthesize their understanding and ap- 
plication of concepts of abilities, values, and interests to their own choices 
. The trend toward introducing educational and career guidance programs 
into the elementary and junior high schools has provided another reason 
to continue discussing the relative contributions of the teacher and special- 
ist to the guidance program. Shimberg (1959) found that when teachers 
were provided the Educational Testing Service manual and guide, You: 
Today and Tomorrow, 108 eighth grade students learned essential elements 
in career planning. 


Teacher Versus Specialist Participation 


Teacher attitudes toward guidance and participation in guidance services 
are positively and significantly related, according to Stewart (1961). Ran- 
domly selected teachers in the state of Washington with high and low at- 
titudes toward guidance had significantly different scores on the Minnesota 
Teacher Attitude Inventory. 

Some authorities have contended tha 
perform that is uniquely different from that of the tea 


t the specialist has a function to 
cher. Hatch and 
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Costar (1961) pointed out that guidance and 
onymous. Prevention programs require the 
cording to Eckerson and Smith (1962a, c). 
surveyed junior high schools and noted th 
guidance functions in core classes 
but they felt that the desirable tren 
guidance personnel. 

Until recently, most guid 
performed with a clinical 
Wrenn (1962) reported the 
selors, indicating that durin 


good teaching are not syn- 
services of the specialist. ac- 
Cottingham and Hopke (1961) 
at teachers are responsible for 
and in other group guid 


ance programs, 
d is toward the utilizat 


ion of specialized 


ance services at the element 
emphasis by school psychologists. However, 
results of a survey of elementary school coun- 
g the next 20 years 45 percent expected their 
role to be one of coordinating many counseling duties and helping parents 
and teachers, while only 23 percent predicted a clinical emphasis, and 
32 percent, a combination of the two. Wrenn also reported a similar trend 
in a Bakersfield, California, s o-year counseling program, 
the teacher’s perception of t e in psychological services 
decreased from 70 to 20 


ary level were 


percent. 


Roles and Functions of Pupil Personnel Workers 


The division of labor so ch 
acterizes education, 
following have been pupil personne] Services, and 
specialists have been 


leir respective functions: at- 
tendance, guidance, health, 


I Psychological, and Social work (Fusco, 1961; 
Council of Chief State School Officers, 1960). i 


Health Services 


lth services is criti 
: iub. s critical, 
discussed the implica š cal 


t : 
of 36,000 ee 1on of school health services 


i erage number of absenc 
by sex and place of residence, 
Neilson (1960) reported a 


- Only 31 


guidance-hea]th Services 
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The school nurse is apparently the responsible health worker (Klein. 
1959). Klein found that 95 percent of pupils with health problems were 
referred to the nurse as were a large number of pupils with the following 
problems: 41 percent with home problems, 75 percent with mental health 
problems, 91 percent with social problems, 36 percent with moral and 
spiritual problems, and 17 percent with vocational problems. 


e 


Psychological Services 


b Seow in psychological services during the last 11 years as measured 
P, membership in the Division of School Psychologists of the American 
HA Association has been a remarkable 800 percent. Higgins 
ae A rather utopian proposal for a hypothetical medium-sized 
idivlasiat ai a psychological service staff with a ratio of 1 psy- 
gist for every 900 pupils. Trachtman (1961) pointed out that Chi- 
cago, which now has 90 such specialists, would need 450; that New York 
City with 100, would need 1.000; and that if every qualified psychologist 
t the country went to work for the schools, the proposed ratio would 
still not be met. 


Self-Perceptions of Counselors 


The self-perceptions of counselors may be too psychologically oriented, 
if the inference from the ASCA study reported by Wrenn (1962) is valid. 
It was found that all of the 242 secondary counselors and 71 percent of 
138 elementary counselors who were ASCA members listed counseling 
students as a basic activity; and between 30 and 45 percent listed con- 
ferences with parents and teachers, test administration, and group guidance. 
However, only 8 percent listed involvement in curriculum development as 
a basic activity; and 12 percent, collection and dissemination of vocational 
information. The same sample of secondary counselors made recommen- 
dations for future counselor preparation. Fifty-four percent selected psy- 
chology; 23 percent, the next subject most frequently mentioned; and 11 
percent, vocational information. One interpretation of these data would 
suggest that at least this sample of counselors is more psychologically 
oriented than, for instance, vocationally oriented. This is significant since 


vocational orientation is frequently listed as one of the most important 


counselor functions. 
Nomenclature of Guidance Services 
The nomenclature of guidance services reflects the state of conflict in 


the profession. A guidance worker may be called a counselor in one setting, 
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d a counselor may be called a psychologist in another. Eckerson and 

Smith (1962a, b, c) discovered that an elementary guidance consultant 
T + oe . 

may be variously titled counselor, visiting teacher, adjustment te 


acher, or 
counseling teacher. 


Role and Function of the Counselor 


The school counselor as a 
personnel worker. If the size 
the full complement of pupil | 
the functions of the other team members, 

Two approaches have been taken to a study of the counsclor’s 
function: (a) The most common approach is the 
of those directly and indirectly related to coun 
the results are summarized. 'These groups include the counselors. them- 
selves, students, teachers, parents, counselor trainers, and 
Another approach has been to relat 
tives, to the needs of children and youth, and to the unique 
of the community, Program decisions are then m 
assignment of duties to each 
and interests, Although the latte 
by the few available references, 
tions to three school « 
priority. Of the other two—socialization and 
ual—the latter is more important, h 
neglected. The counselor is c 
one else is. He gives inform ains from determining how it 
should be used. He appraises the student, but does not judge him. He 
works with disciplinary problems, but does not punish; helps plan, but 
does not make choices, 

The approach to a study of counselor role 
a survey of practices jg currently the more p 
the greater number. of references, The four major functions performed 
by the school counselor according to Wrenn (1962) are ( 
dents; (b) consult with teachers, parents. and 
and interpret changing 
counseling services, Malcolm (1961 
and functions in such areas as counseling, consulting, and research will 
result in more precise terminol i 
Education (1962b) found th 
counseling students, : 
sulting with parents, 
and liaison with the c 

The ASCA study ri 
time counselors, 35 percent 


or mor i 
students; 30 percent Spent two periods or more consulti e counseling 
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title and as a functionary is the chief pupil 
and policy of the district does not permit 
»ersonnel workers, the counselor is 


assigned 


role and 
one in which the opinions 
seling are surveyed, and 


others. (b) 
€ staff personnel to educational objec- 
characteristics 
ade, followed by an 
member of the staff according to skills 
r approach is not a common one, judging 
Sorenson (1961) related counselor func- 


objectives, with the intellectual one given highest 


development of the individ- 
arder to achieve, and more likely to be 
ommitted to the indiv 


idual in a way that no 
ation, but refr 


] State Department of 
at counselors divided their time as follows: 
seling duties, 38 Percent: con- 
Consulting with teachers, 10 percent; 
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and parents; and 70 percent spent one or two periods doing group work 
with students. . 

One function that to date appears to be unacceptable to counselors is 
handling discipline, a function which one-half of the administrators sam- 
pled by Cortale (1961) felt that they should perform: the administrators 
also felt that one-third of the teachers would accept the counselors more 
if they did handle discipline. Opstad (1960) apparently recommended 
that counselors accept discipline referrals, but with certain restrictions 
such as not acting ‘in a punitive, threatening, or investigating role. The 
Bakersfield, California, study reported by Wrenn (1962) found that teach- 
ers did not accept the counselors’ heavy emphasis on clinical counseling 
and that they expected counselors to assume a role in discipline. 

The opinions of students were surveyed by Heilfron (1960) and by 
Gibson (1962). Heilfron found that students perceived counselors as work- 
ing primarily with abnormal cases. The areas of deficiency identified by 
Gibson could be interpreted as functions according to students’ percep- 
tions, Bergstein and Grant (1961) surveyed 200 parents distributed over 
four grade levels and found that they perceived the counselor favorably, 
more in educational and vocational areas than in personal-social problems. 

A significant problem in guidance administration is the degree of agree- 
ment on the counselor’s role between the counselor and his administrators. 
A Q-sort by Schmidt (1962) showed essential agreement, but counselors 
and administrators may have been satisfied about different things. Chenault 
and Seegars (1962) compared results on Leary’s Interpersonal Check 
List and concluded that the significant factor in differences between admin- 
istrators and counselors may not have been expectations of each other's 
role but expectations of each other as persons. 

Two current studies will have a great impact on role and function defini- 
tion. The standards and role studies of the Association for Counselor 
Education and Supervision and American School Counselor Association 
(1962) will attempt to define counselor role and function by surveying the 
opinions and practices of counselors and counselor trainers. The assump- 
tion seems to be that those performing the counseling service should set 
standards and develop criteria. Those who would define the counselor's 
role by determining the role that guidance or pupil personnel services 
should play in the total educational program would not necessarily agree 
that the role of the counselor is what the counselor does. Practically all 
authorities and counselor surveys, for example, list vocational placement 
as a counselor function (Wrenn, 1962), but the number of placement 
services indicates that the service has not become policy in most schools. 


Issues Relating to Coordination of 
Pupil Personnel Services 


The trend toward organizing pupil personnel workers as a team will 
apparently continue but not without problems. Attempts to view the total 
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i am i integrative fashion were made by Johnson. 
upil personnel program in an integrative fas y 
Steffre, and Edelfelt (1961) and by Rosecrance and Hayden (1960). Doth 
, 
texts synthesized previous research on such matters as 


incidence of special- 
ists, their duties, training, and suggested ratios, 


Dt 


Roles and functions of the various team members show much duplication 


when definitions by each group are compared. Job titles and job functions 
become confused in research, discussions, and association activities, If 
counselors counsel. what docs a social worker do when working with an 
individual? 

The Commission on Guidance in American Schools (Wre 
ommended elimination of the term 
point of view. McCull 


nn. 1962) rec- 


r, disa- 
services, but only in a 
team of workers repre- 
l that it would be futile to 
In fact, counselors could not 
ing criteria are met: counseling differ- 
entiated from other social service occupations, standards accepted for se- 


8g, means developed for accrediting training institutions, 
certification standards established, counseling services improved, and a 
code of ethics enforced. 


A critical problem 
the position of pupil 


ment structure of the school, Knap 
concern for the s 


the tendency for counselors to w 
which h 


greed. He proposed 
school setting; wher 
senting 


ganization and manage- 
l) expressed some 
ance workers and about 
als rather than teachers. 


observed from surveying 
variations existed in se 
nd job titles, T 


i ib dM ses and counselors were line- 
Tepon le to the principals; and, į e nurses and dental 
hygienists Ben the principal and the staff 
specialist. 


nne] subdisciplines supple- 
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ment their own efforts at role definition by working together toward a 
common end and as a total group join with other educational departments 
and administrators in determining the unique contribution of pupil per- 
sonnel services to the total cducational program. 
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CHAPTER III 


Higher Education Programs 


ESTHER LLOYD-JONES and MARGARET RUTII SMITH 


Once AGAIN it must be reported that research on student personnel pro- 
grams in higher education in the past three years has been characterized 
by paucity. The little that has been done will be reported under the follow- 
ing headings: four-year colleges, extended education, and junior colleges. 
What might be described as sage literature—thoughtful writing by those 
of broad and long experience—will be briefly summarized. 

In the October 1960 issue of the Review, Feder and Siller concluded that 
there is little evidence concerning the validity of certain stereotypes of 
organization and function that have been adopted in student personnel 
practices. They recommended more research involving use of controlled 
experimentation as well as study of similar institutions with some varia- 
tion of program in order that the assumptions under which present pro- 
grams are operating might be validated. Although the tendency to lean 
heavily on surveys of practices may reveal normative practice and, in 
effect, establish least common denominators, it may stifle organizational 
and operational experimentation. 

Although the guidance movement is entering into its second half-century, 
Rothney and Farwell reported in the April 1960 Review that there is 
general recognition of the need for evaluation of guidance services. How- 
ever, there is little evidence that this need is being met. Guidance services- 
like many other services in education, are still offered largely on the basis 
of hope and faith. 

After reviewing research activities pertaining to students and student 
personnel programs being conducted by 13 agencies and centers for the 
study of higher education, Robinson and Brown (1961) reported 77 studies 
in progress or recently concluded. All of these have to do with such matters 
as student characteristics. attendance, retention, and withdrawal. as well 
as with aspects of student personnel programs. No studies were reported 
that looked at student personnel programs as a whole. 


Four-Year Colleges 


serve mention here. Two 
titution’s student 
ral arts colleges; 
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Five studies of student personnel programs de 
doctoral students have attempted to evaluate a single ins 
personnel services. A third study surveyed a group of libe 
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another, a group of small colleges. The fifth tackled the problem of stud y- 
ing student personnel work in the context of a college that is 
Christian. 

Beckers (1960) attempted to evaluate the student personnel program 
on the undergraduate level at the Auburn University School of Education. 
He took five basic steps to achieve this purpose: (a) The role of student 
personnel programs in teacher education was defined 
from which appropriate criteria evere derived for 
sonnel policies and practices, 
personnel program in the Scho 
lyzed. (c) Student and faculty 
were obtained. (d) 


pr fessedly 


as a background 
appraising student. per- 
(b) The major elements of the student 
ol of Education were described and ana- 
reactions to the student personnel services 
The student personnel program was evaluated in terms 
of the criteria derived in the initial step. (e) Recommendations were made 
on the basis of the findings and conclusions of the study. Data for the 
investigation were obtained from observations 
program, from interviews with student 
questionnaires completed by 
tinent faculty reports. 


Beckers concluded the following: ( 
accepted in all or most respects a poi 
values for the institution as a w 


of the student personnel 
personnel staff members, from 


student and faculty members, and from per- 


a) The faculty respondents, in theory. 
nt of view consistent with 


hole. This point of view, how 
student personnel program, ( 
faculty respondents about wh 
nel program was and wh 


democratic 
ever, had not 
b) There was 
at the over-all 
at its relationship 
There was general 


ents with respect to 
al student. personnel service areas. (d) The 


i rOgr: as well supported by administrative leader- 
ship, but was lacking in ov (e) The availability of 


to some faculty members from two depart- 
ments. (f) The student pers ly were durent in 
group that generally had a more 
el program was better acquainted 
was the student group. 
gram at Cl ji i ] 
(1960), who attempted not p e Collage ed Stüdied hy Brantley 
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the development of the 
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n Contemporari]y. 


: : and in which the environ- 

ments, interests, and problems of the Students were known, an adequate 
od a 
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and evaluation could be provided; and information useful to similar insti- 
tutions could be made available. 

Two of his recommendations were (a) that a student personnel worker 
should understand before his employment any differences that exist be- 
tween the philosophy of the college and his own educational philosophy 
and (b) that the administration of the college should clarify for the faculty 
and the staff the extent to which each personnel service is essential to the 
total program of the institution. 

" or forms were sent by Arbuckle and Kauffman (1959) to 228 
— aa ieee coal arts colleges.” OF the total number, 186, or 
i (e) jene iN areas were studied: (a) orientation, (b) housing 
hine e: 1 services, (d) financial aid programs, (e) counseling 
The sc ids m bp and personal) , and (f) self-government activities. 
bp ene rom this survey indicates that the modern liberal arts college 

J ans unaware of the role of student personnel services in higher 
education. All the colleges offer some form of most of these services. The 
colleges reported that although they give most attention to housing and 
health, they also are quite heavily committed to vocational services. 
Arbuckle and Kauffman concluded that (a) these colleges are not so anti- 
vocational as is sometimes assumed, and (b) counseling is the service 
which is least effectively provided and concerning which a need for 
improvement is most expressed. 

In a sampling of 16 small liberal arts colleges in the North Central 
Association area, Scott (1960) compared personnel services, principles, 
and practices. He found that although the principles of student personnel 
work were comparable in small and large institutions, the method of im- 
plementing the principles varied widely. He suggested that small colleges 
think seriously about the methods that they use and about the ways in which 
the services are coordinated with each other and with other areas of the 
institutions. It should not be assumed that the system of services for 
students will automatically be well coordinated just because a college is 


small. 


A study by Riggs (1962) of personnel work in the context of a Christian 


college assumed that student personnel work—like work taught in class- 
rooms—must be intimately related to clearly defined institutional goals. 
From the statement of institutional goals of De Pauw University, the per- 
in weekly meetings and with the help of nationally distin- 
guished thinkers, developed a specific set of goals for the University 
student personnel program. The basic needs of the kinds of students drawn 
to and accepted by such a university were carefully studied; and coun- 
seling, discipline, perspective, and administration were viewed against 
this Lackground. This study is cited, not because its research design is 
clear and its methods rigorous, but rather because its approach moves in 
the direction advocated by McConnell (1960), Mueller (1961). Brunson 


(1959), and Williamson (1961). 


sonnel staff, 
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Extended Education 


Simonaitus (1960) undertook a study to determine practices and de- 
sirable provisions of the student personnel programs in evening colleges 
and to examine their influences on enrollment. His findings were based 
on data from 79 colleges who returned usable questionnaires from among 
the 120 colleges to whom they were sent. He concluded that most of the 
provisions considered essential in a personnel program for day college 
students were also considered essential provisions in a personnel program 
for evening college students. 

Motto (1961) reported from the Flint Extension Center of the Univer- 
sity of Michigan a pilot program initiated in 1958 that was oriented 
toward analyzing student needs, defining the counselor's role, and integrat- 
ing advisement practices with those of administration. This program asked 
the following questions: (a) What needs exist for advisement in a gradu- 
ate study center whose total yearly population exceeds 2.000? (b) What 
perceptions of advisement are held by graduate students? (e) What 
student needs, if any, could be met most effectively by a resident advise- 
ment program? (d) What range of problems would be presented? (e) 
What competencies would be required of an adviser who was to service 
this population? The answers to these questions were sought as a basis 
for developing a rationale for advisement services in an extension and 
graduate study center. The results showed the following advisement service 
requests: (a) educational, 66 percent; (b) vocational, 18 percent; (c) 
therapeutic, 9 percent; and (d) personal, 7 percent. From these results. 
Motto drew the conclusion that since one-third of those requesting con- 
sideration wanted services other than educational counseling, a counselor 
to be employed might well have not only a doctorate in clinical or coun- 
seling psychology but also such other prerequisites as a practicum train- 


ing in counseling, public school experience in education, and training in 
a medical setting. 


Junior Colleges 


Not too surprisingly, there has been a spate of studies on the junior 
college level, some of which offer f 


For the Commission on Stud resh insights for research on other levels. 
on Student Personnel of the American Association 


of Junior Colleges, McDaniel (1962) attempted to identify essential stu- 
dent personnel practices in junior colleges, The following salient points 
were set forth: (a) The purposes of the college are principal determiners 
of other forces. (b) A student personnel program is an tiad pattern 
of services and activities that makes planned use of D i of 
restrictive controls in order to assist each student to make full usn idi his 
college education in progressing toward the &oals of the college. (c) 
Essential information about students must be determined eme ana- 
> , 
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lyzed, and used. (d) Systematic evaluation of tests used, of grading prac- 
tices, of class admission standards, and of all college personnel policies 
is the only protection against mistakes. 

Medsker (1960) found in a survey of 76 junior colleges in California 
that administration and boards of control had not put student personnel 
services on a par with instruction and that all too often insufficient budget 
appropriation had been made to insure a full range of such services. The 
following elements of strength were noted: (a) Administrators recognize 
the responsibility of a two-year college tor an effective student personnel 
program. (b) Somé type and level of counseling program exists in each 
college. (c) Most colleges have a well-developed student activity program. 
(d) A good system of academic records is maintained in all colleges. The 
following elements of weakness were cited: (a) Many institutions lack 
poliey formation, planning, and professional direction of the program. 
(b) The counseling program in many institutions is inadequate. (c) Little 
research is conducted to enable the colleges to obtain facts about their 
students. (d) Two-year colleges make only a limited effort to evaluate 
their personnel programs. 

Glanz, Hayes, and Penney (1959) described the experimental use of a 
freshman psychology course as furnishing what they believed to be a 
successful basis of a student personnel program in Boston University 
Junior College. Although their report made no mention of objective evalua- 
tion, Gustad (1959), who disagreed with the experimenters, maintained 
that the academic program should emphasize one set of goals and that the 
personnel program should accentuate another although obviously related 
set. On the other hand, Williamson (1960) agreed with Glanz in stressing 
that personnel workers should be concerned more, as are the faculty, with 
the process of education and learning than with the activities underlying 
administration of services. 

Starr’s (1960) survey of guidance practices in selected junior colleges 
in the Northwest revealed that practices stem from four basic concepts: 
(a) The junior colleges are by nature and, in some instances, by law, 
preprofessional, vocational, special-education, and general-education insti- 
tutions dealing with students with varying degrees of maturity, academic 
development, ambition, and attitudes. (b) Each junior college must be 
free to develop its program to meet its particular needs. (c) Counseling 
is important in all phases of the student’s life. (d) Counseling and guid- 
ance are essential in the junior college. 

The following characteristics of programs reported by Starr were noted: 
(a) Personal-social guidance has not been extensively developed. (b) Lack 
ed counselors is ù major weakness. (c) Academic-voca- 
tional. guidance receives most attention. (d) Extensive use of local com- 
munity agencies is evident. (e) Extensive use of faculty members often 
untrained in guidance seems to be customary. (f) Testing programs are 
offered more consistently than are other services. (g) Little, if any, re- 


of available train 


search or evaluation is evident. 
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A comparative study of guidance programs in private € Hey Das 
colleges was made by Rice (1959). Fifteen private New 1 ng anie peor 
public California junior colleges were studied by means of (al fue 
and analysis of publications; (b) questionnaires sent to presidents, c SRI 
counselors, and teacher-counselors; (c) visits to campuses for intensive 
investigations; and (d) personal interviews with officials. He found that 
guidance aims differed in terms of quality and in relative degree of ac- 
ceptance. Private junior colleges emphasized the aims of mental health 
and personal guidance, while punlic junior colleges stressed the aims of 
vocational and general guidance. Guidance personnel also differed in many 
professional respects. For example, public junior college personnel were 
found to be chosen on the basis of standardized and professionally valid 
criteria. Compared with private junior college personnel. they had greater 
experience and training and enjoyed higher professional status and com- 
pensatory benefits, Private junior colleges kept more extensive student 
files of superior over-all quality. Although both private and public colleges 
offered standard guidance services, neither seemed to have psychiatric 
help available. No mention was made of the relationship of guid 
services to institutional objectives, although 
leges emphasized academic guidance. 

Klitzke (1960) pointed to the need for research at the junior college 
level and indicated important questions dealing w 
students and parents, staff, and lay public that need to be answered. He 


emphasized that student personnel workers need both more research op- 
portunities and more research capabilities. 


ance 
both public and private col- 


ith individuals, groups, 


Issues Underlying Research Needs 


Student personnel work cannot contribute effectively unless the over- 
riding goals and value patterns of the institutions are clearly defined and 


attainable (McConnell, 1960). Educational goals, however, are usually 
stated in such general form that the 


y do not provide a unifying rationale 
as a concrete and dependable guide for the institution’s activities. 

Brunson (1959) made a thoughtful study of the concept of organic 
administration and of the need for integrating guidance with the rest of 
the college educational program. She examined by means of case studies 
the processes used by a variety of liberal arts colleges that were attempt- 
ing to achieve a higher degree of integration of the separate parts of their 
education programs. . 

The structure of the program cannot be divorced w 
policy (Williamson, 1961). However, the policy that does and should 
determine the nature of an enterprise is often so all-pervasive and long- 
standing that it is frequently taken for granted and sometimes not even 
understood in detail. To a great extent, education that has been content 
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to organize its programs has failed to study systematically the outcome or 
products in order to determine the improvements needed. 

The personnel worker may first enumerate his goals and projects and 
then attempt to assess his procedures and accomplishments. If he cannot 
know the implications of the outcome of his work or evaluate them against 
the larger world of knowledge, he at least owes to himself the testing of 
his hypotheses in order that his students and his school may aspire to 
attain the goals set forth in the program, (Mueller, 1961). 

The efective administrator expects continuous reorganization and 
change as improvements are indicated by new and relevant research or 
evaluation of current programs. The empirical and evaluative approach to 
organizations and structures would seem to be necessary to avoid the 
danger of maintaining an inappropriate and outmoded program (William- 
son. 1961). 

What are the changes taking place about which personnel workers need 
to be aware when they re-examine and subsequently modify their goals 
and activities? Wise (1961) has summarized them as follows: (a) exten- 
sion of educational opportunities to increasing numbers of the population, 
(b) a rise in the rate of marriage on campus, (c) a substantial shift in 
academic interests among college and university students in the United 
States, (d) an apparent improvement in the quality of entering freshman 
students, (e) the existence of a tighter curriculum and of higher standards 
in elementary and secondary schools, (f) the presence of more amiable 
relations between faculty and students, and (g) a projected increase in 


budgets for student personnel work. 


A Look Ahead 


ext three years those undertaking studies of 
college personnel programs will draw heavily on theory, on methodology, 
and on findings represented in research recently completed and presently 
being conducted by sociologists, social anthropologists, and social psy- 
chologists. Such research would appear to offer much at both conceptual 


It is hoped that in the n 


and practical levels. 
Additional References: Becker (1961); McConnell (1962); Sanford 


(1962) ; Sprague (1960). 
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CHAPTER IV 


Selection, Preparation, and Professionalization 
of Specialists 


ROBERT O. STRIPLING and JAMES L. LISTER * 
e 


Is THE April 1960 Review, Hill and Green expressed a hope that short- 
ages of personnel workers would result in more and better research 
publications on selection, preparation, and professionalization. While a 
review of the literature does indicate an increase in the number of publica- 
tions relating to these topics, there does not seem to have been a signifi- 
cant increase in the reporting of basic research. 


Selection of Personnel Specialists 


Santavicca (1959) reported that 85 percent of the counselor education 
programs surveyed considered the undergraduate record of applicants. 
Other considerations were scholastic ability, teaching and work experience, 
personal adjustment, and social interests. The lack of research reported 
on selection may support Hill's (1961a) statement that while other factors 
are important, initial admission is largely contingent upon admission to 
graduate school. Emphasizing that counselors are drawn largely from the 
teaching ranks, Hill (1961b) characterized selection as a process extend- 
ing from a teacher's evidenced interest in guidance work to his evaluation 
during a practicum experience. 

Writers agreed upon the importance as well as the difficulty of assessing 
counselor personality characteristics. Patterson (1962) studied the char- 
acteristics of vocational rehabilitation counselor students as measured by 
the Miller Analogies Test. Kerr and Sperofl’s Empathy Test, the Edwards 
Personal Preference Schedule, the Minnesota Multiphasic Personality 
Inventory, and the Strong Vocational Interest Blank. The personality char- 
acteristics of National Defense Education Act (NDEA) guidance institute 
enrollees were studied by McDougall and Reitan (1961), who used a 
peer-rating technique. and by Kazienko and Neidt (1962), who differ- 
entiated between good and poor enrollees through self-descriptions on the 
Bennett Polydiagnostic Index. Giblette (1960) found tests of empathy, 
emoticaal stability, and practical judgment useful in differentiating among 
three levels of rated competency of practicing school counselors. Gazda 
and Ohlsen (1961) reported that prospective counselors appeared less 
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well adjusted after seven weeks of group counseling: 14 months po 
they seemed somewhat improved. Tyler (1960) described selection ec 
niques used in the first 50 NDEA institutes. In addition to the minimum 
qualifications stipulated in the Act, institute staffs were inpr Ria ^n t 
develop their own assessment techniques. Standardized tests of academic 
aptitude, personality, attitudes, and values were employed. 


Preparation of Counselors 


Counselor preparation has received no greater impetus than that pro- 
vided by the NDEA. Tyler (1960) described the guiding principles, struc- 
ture, and contributions of the initial 50 institutes. Seventy-one 
the directors surveyed by Klopf and Cohen (1962) 
of the institute programs had had 
education programs, 

Riccio (1961) compared counselor-edu 
visors who were members of the Associ 
Supervision (ACES). He considered level of preparation, relationship 
between geographical area in which the degree was gr 
current employment, degree-granting institutions, 
of doctorates earned. Polmantier and Schmidt ( 
hensive survey of courses offered by state uni 


to prepare guidance workers, Of the 54. univ 
courses in all 13 areas ment 
Most of the work inves 


percent. of 
reported that the impact 
an effect on their regular counselor 


cators and state guidance super- 
ation for Counselor Education and 


anted and area of 
and trends in the type 
1960) reported a compre- 
versities in areas designed 


ersities surveyed, none offered 
ioned in the inquiry. 


tigating the status of 


degree in counseling, with approximately 30 ho f credit, Harmon and 
Arnold (1960) asked ine RU, REIS Han 


Few of the counselor education ed included any system- 
alic evaluation of behavior changes in counselor candidates, A notable 
exception was the study by Munger and Johnson (1960) of attitudinal 
changes associated with participation in a summer NDEA guidance insti- 
tute. Enrollees evidenced an increase i 


n the use of understanding response? 
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and a decrease in the use of diagnostic and probing responses in their 
interactions with clients. 

This period witnessed increasing debate over content areas in counselor 
preparation (Hoppock. 1961). One of the outstanding controversies cen- 
tered around the desirability of requiring teacher certification and/or teach- 
mg experience for counselors. Hoyt (1961) and Hudson (1961) main- 
tained that teaching experience was essential, but Cohen (1961) and 
Lloyd-Jones (1962) recommended experimentation to gather the needed 
evidence. Cohen outlined a pilot research program, and Lloyd-Jones pro- 
posed the organization of experimental counselor education programs for 
promising candidates from nonteaching curriculums. Sherman (1962) 
discussed the merits of generalist versus specialist approaches to counselor 
Preparation, and Hutson (1962) recommended a teaching apprenticeship 
for preservice counselors. Wrenn (1962) reported that none of the 
32 counsclor-educators participating in the Commission on Guidance in 
American Schools’ study mentioned teaching experience or any other 
particular experience when asked what the emphases would be upon coun- 
selor competencies and preparation 20 years later. 

Walz and Roeber (1962) asked counselor-educators to respond to a 
typescript of a counseling interview. Responses indicated that supervision 
might be largely cognitive, information-giving, and concerned with coun- 
selor verbalization, premature evaluations, and variation in supervisory 
behavior. Schmidt (1960) described internship experiences at Indiana 
University, which included supervised practice in the schools for employed 
teachers as well as preservice counselors. Ivey (1961) described a summer 
guidance practicum at Bucknell University, which emphasized testing and 
test interpretation. The University of Minnesota, Counselor Education Staff 
(1960), described a supervised experience that included field practice in 
addition to a counseling practicum. The San Diego State College intern- 
ship reported by Malcolm (1961) stressed the aspects of service to the 
schools as well as professional preparation. Secondary students who served 
as clients reported high satisfaction with the experience. 

This period saw growing concern for the professionalization of coun- 
selors, accompanied by increasing dissatisfaction with preparation pro- 
grams. These concerns were highlighted by the publication of the Wrenn 
(1962) report. , 

The Counselor in a Changing World, while written by Wrenn (1962), 
was an outgrowth of deliberations by the Commission on Guidance in 
American Schools. Presenting recommendations for counselor preparation 
against a background of social, cultural, scientific, and educational changes, 
Wrenn’s work has intentionally resulted in critical discussions. Winfrey 
(1962) reported ASCA members’ reactions to 23 recommendations from 
the Wrenn report and stressed the need for clarification of counselor 
function and preparation. 

Increasing concern over standards for counselor education as well as 
the need for clarification of counselor role, functions, and conditions of 


178 


— 5 
REVIEW OF EDUCATIONAL RESEARCH f Val, AXATI, Xo. 


work led to the launching of the studies by the Association for Counselor 
Education and Supervision and American School Counselor p Sig 
(1962). Involving regional chairmen and local study committees, the goa ; 
of these national studies are to provide standards of preparation m 
descriptions of counselor role, functions, and conditions of work that wi 
ultimately receive endorsement by the parent organization, the American 
Personnel and Guidance Association (APGA). The interim report of the 
Association for Counselor Education and Supervision and American School 
Counselor Association (1962) contains position and reaction papers pre- 
sented at the 1962 APGA convention. In addition to these major efforts by 
committees and organizations, such individuals as Hitchcock (1962). Dugan 
(1961), McCully (1962), Feder (1961), and Pierson and Grant (1959) 
emphasized the need for decisive steps toward professionalization. 

Schmidt (1962) studied counselor role perceptions held by school coun- 
selors and their principals. Using a Q-sort technique, he compared inter- 
counselor and principal-counselor agreement on actual 
functions. While many areas of significant agreeme 
observed relationships, which failed to account for 
variation, suggested that areas of substantial disagr 


and ideal counsclor 
nt were noted, the 
much of the existing 
eement remained. 


Professionalization of Counselors in 
Educational Settings 


Walker (1961) reported the efforts made by Akron, Ohio, 
alleviate their shortage of professionall 


service education. Jean Wellington ( 
graduate students from Tufts Univ 
Massachusetts, guidance dep 
on-the-job education progra 
strongly supported by schoo 


schools to 
y qualified counselors through in- 
1961) described efforts to involve 
ersity in the program of the Winchester- 
artment. Camp (1959) described 
m in the Baltimore County scho 


l personnel. A questionnaire surv 
gura (1960) of 32 New Jersey junior high schools revealed 


of the approved schools were actually conductin 
programs. Of the approaches describ 
in all schools. Evaluations of these p 

A reliable index of increased prof 


a systematic 
ols that was 
ey by Buona- 
that one-third 
g in-service education 
ed, only the faculty meeting was used 
rograms were highly subjective. 
"uw olessionalization of any group of special- 
ists is its movement to adopt certification standards. The latest U.S. Office 
of Education report (Brewster, 1960) listed 37 states with mandatory 
certification requirements and 4 With optional requirements, compared 
with 34 mandatory and 7 optional, respectively, in the in dus report 
(Brewster, 1957) . While certification requirements have increased, it should 
be noted that specific requirements vary greatly even among states; further- 
more, one may question, in the absence of evidence, how many employed 
counselors actually satisfy these minimum requirements, Also, it should be 
noted that the essential question concerning quality of preparation has not 
been examined. Lloyd (1962) compared attitudes of counselors and 
174 
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counselor-educators toward the mandatory certification requirements of 
34 states. Both groups expressed dissatisfaction with certain aspects of 
existing requirements as well as disagreement regarding desirable certifica- 
tion. This led Lloyd to note the need for research on the adequacy of 
expressed opinions as well as their most effective implementation. 
: While the number of states with mandatory certification requirements 
is increasing, such standards may not really provide a satisfactory basis 
for quality control (Dugan, 1902). Dugan further noted that reciprocity 
among states was impossible until morê common bases for certification 
were nationally recognized. He recommended the initiation of a national 
cooperative study of standards. McCully (1961) questioned whether the 
school counselor's functions were sufficiently unique to justify certification; 
he cited the absence of certification requirements in some states as evidence 
that the functions performed by the school counselor lack sufficient recogni- 
tion to warrant nationwide certification. 
ueri for ethical behavior of personnel workers was evidenced at the 
i PGA convention by the adoption of a code of ethics as set forth by 
Arsenian (1961). The adoption of this code may well be another major 
step in the direction of professionalization of counselors. 

Although the legal position of counselors remains unclear, Garber (1960) 
stated that the Iverson case is a step in the direction of needed clarification. 
Miller and Simpson (1961) presented a review of the legal aspects of the 
counseling relationship under the headings of expert witness, professional 
immunity, and counselor liability. 

While the exact ratio of counselors to students in the public schools 
remains a controversial issue, few doubt the existence of a serious shortage 
of counselors. Crummel (1961) estimated that there is a shortage of more 
than 18,000. Dugan (1961) estimated that at least 5,000 qualified new 
counselors will be needed each year through 1970 and suggested selecting 
prospective counselors from a variety of academic backgrounds. 

Many factors seem to contribute to the wide range of salaries. Wrenn 
(1962) gave a median salary of approximately $6,000 for the counselors 
in Project Talent. In this sample, 9 percent earned less than $4,000 and 
10 percent, over $8,000. It should be noted that 47 percent of these coun- 


selors were employed for school terms of 10 months. 


Preparation and Professionalization of Other Specialists 


Little research appropriate for this chapter has been reported concerning 
the preparation and professionalization of specialists other than counselors. 
The scarcity of publications considering such specialists as the school 


ker, and health worker from the broad pupil per- 


psychologist, social wor 
has not gained 


sonnel point of view may mean that such a point of view 
widespread acceptance yet. 

In considering personnel specialists in broad pe 
minimum of evidence on the incidence of such specialists in the public 
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schools. An exception was a stalistical report of the 
Association (1961), 
» Den call Berg (1962) have been explicit in differentiating between. the 
roles of the school nurse and the teacher in the area of school health. T hey 
emphasized that while their respective roles are distinctive, they are both 
members of the school health team. Klein (1959), who was more explicit 
about the guidance functions of the nurse. pointed out that health counsel- 
ing was given in 82 percent of the public schools surveyed, She also 


reported that organized guidance programs existed in about one-half of 
the public schools employing nurses, and that, in the 


usually considered members of the guidance team, 
A statement on functions and qualifications of school nurses 


the American Nurses’ Association, Public Health Nurses, 
Branch (1961), stressed understandi 


mastery of specific nursing skills, 
y p E 


American Nurses? 
which showed approximately 10,000 nurses employed 


se schools, nurses were 


adopted by 
School Nurses 
ng of school and community as well 
Specific functions relating to counseling 
and guidance were directed toward the elimination or minimizing of health 
problems that interfere with learning and development, 
In discussing the social worker’s function in the school, Nebo (1960) 
emphasized the importance of an interdisciplinary and interprofessional 
approach involving a broad array of school and community personnel. 


as 


Summary 


Research on selecti ion, and professionaliz 
specialists remains lim; rveys and position 
and desirable concomi- 
ee them soon paralleled 


€s are inevitable 


tants of professiona th, one would hope to s 


l grow 
by basic research, 
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CHAPTER V 


Counseling 
ROBERT CALLIS 


o 


mm 

Tas RESEARCH on Counseling reported in the three-year period covered 
by this Review has been extensive and meaningful. With few exceptions, 
the methodology employed was sophisticated and appropriate for the task, 
and the researchers seemed to be genuinely interested in discovering new 
knowledge rather than in proving theoretical convictions. 


Theory Construction 


It was heartening to see several articles in which attempts were made 
to synthesize and to integrate theories pertaining to counseling. In Callis’ 
(1960) article, the key concept was that clients who seek assistance from 
counselors have some current or anticipated inadequacy in their behavior 
repertoire. Inadequacies could be caused by (a) lack of experience, (b) 
distorted perceptions, and (c) erroneous generalizations. The task of coun- 
seling is to discover inadequacies and ways of correcting them. These 
discoveries can be made by either the client or the counselor or by both. 
Two general methods of counseling were postulated from (a) client self- 
discovery and (b) counselor discovery and the interpretation of the dis- 
covery to the client. 

In Layton’s (1961) model, the principal ingredients were (a) external 
reality, (b) perceptions of self and others, and (c) communication between 
reality and the perceived self. Perceptions and possible distortion in per- 
ceptions are central concepts. One of the goals of counseling is to bring 
the client’s self-perception into closer congruence with reality. It would 
not be difficult to blend the notions of Callis and Layton into a single 
theoretical model. Also, many of the notions of self-theorists could be 
t would appear then that the two constructs— 


encompassed in this model. I l 
lity—are becoming embedded firmly in the 


perception and external rea 1 
theoretical formulations about counseling. 
Additional Reference: Weitz (1961). 


Avowed Goals 


As has been either stated or implied in the previously described theo- 
retical formulations, the avowed goal of counseling is to assist the client in 
changing his behavior repertoire so that he can behave in a more effective 
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and personally satisfying manner. Tyler (1900), in differentiating coun- 
seling from psychotherapy, stated that the goal of counseling is to bring 
about better utilization of the personal resources that the client already 
possesses. She indicated that the object of counseling is not a major 
reorganization of personality but a minimum change in psychological 
direction. To accomplish this, it may be necessary to reduce the client's 
anxiety and thereby to increase the client's problem-solving behavior. 
Berdie (1959) indicated that the goal of counseling is behavioral change. 
He suggested not a sweeping personality change but the solving of immedi- 
ate, specific problems, which, in and of itself, may be suflicient to remobilize 
the client. He agreed with Tyler when he indicated that the reduction of 
anxiety and of defense mechanisms may he a necessary prelude to increas- 
ing the problem-solving skills of the client. Berdie cited some behavioral 
changes that may be the specific goals of specific clients, e.g., higher 
grades in school or college, longer persistence in an education program 
(fewer dropouts), more accurate understanding of self, and greater self- 
acceptance. Apostal (1960) and Gonyea (1962) proposed appropriateness 


of occupation engaged in (or planned for) as an outcome criterion for 
certain cases. 


Additional References: Curran (1960) ; Goodstein and Grigg (1959) ; 
Samler (1960). 


Obtained Results 


Educational 


Goodstein and Crites (1961) studied the eff 
grades earned by low-abilit he experimental group was 
given brief counseling; group was not. Difference in grade 
point average earned by the two groups favored the control group for the 
es "cio Se qi period) and for the following semester: 

ichardson conducted a similar 
AAPEA faa arer ri study and found that the counseled 


o 
bers were afforded three interviews each during the fall semester; the con- 
trol group members were afforded none, The interviews, conducted by the 
director of student activities of the college, consisted vitari] of inform- 
ing the student about his academic abilit : 
the end of the semester, the two groups wi 
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criteria. There were no significant differences between the two groups. 
Searles reported that many of the students in the experimental group ex- 
pressed no interest in or need for the counseling they received. 

. Baymur and Patterson (1960) identified 32 underachieving high school 
juniors and assigned them to four matched groups of eight students each. 
Group A received individual counseling, 10-12 interviews per student. 
Group B received group counseling for 9 sessions. Group C received a one- 
session group experience in which the stddents were informed that they were 
underachievers and were exhorted to do something about it. Group D 
received no treatment. Criterion data, obtained at the beginning and at the 
end of the experimental period of one semester, were (a) a measurement 
of personal adjustment, (b) a measure of study habits and attitudes, and 
(c) the change in grade point average between the experimental semester 
and the previous semester. The four groups did not differ significantly on 
any of the criteria. However, a comparison of the combined counseled 
groups with the combined noncounseled groups indicated that the counseled 
groups made significant gains in personal adjustment and grade point 
average. 

Watson (1961) studied the eflect of counseling on college students who 
had been referred for counseling because of personal-emotional problems. 
The 66 students in Group A accepted the referral and made use of counsel- 
ing (a median of 10 interviews) ; the 45 students in Group B denied any 
need for help to the intake interviewer; the 23 students in Group c made 
no response to the offer of counseling. The three groups were approximately 
equal in academic ability and in academic performance at the time of 
referral. Two years later, 68 percent of Group A, 51 percent of Group B. 
and 43 percent of Group C had either been graduated from or were still 
enrolled in a degree program. However, differences in motivation must be 
held, along with counseling, as à contributing factor to the observed 
criterion difference. 2. 

Shepherd (1962) used graduation from college as the criterion for 
al group consisted of 295 students who had 
requested and received counseling at some time during their college career, 
The control group consisted of classmates of the experimental group 
matched by sex, academic ability, and academic division. Five years after 
the subjects had enrolled as freshmen, Shepherd ascertained that about 
50 percent of the counseled group had been graduated, while about 37 per- 
cent of the control group had been graduated. There was no significant 
difference in academic ability between the graduates and the nongraduates. 
As in the case of Watson’s (1961) study, possible motivational differences 


between the two groups were not controlled. 


counseling. The experiment 


Vocational 


Apostal (1960) compared the judgments of counseling adequacy based 
on case-folder data with the judged appropriateness of occupation engaged 
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in approximately seven years after counseling and found no relationship 
between these two criteria. Gonyea (1962) found that a significant increase 
in appropriateness of vocational choice had occurred between the time that 
counseling had begun and terminated. However, there was some question 
about whether this increase was still in effect a year after counseling had 
terminated. 

Lurie, Goldfein, and Baxt (1960) studied the effect of vocational coun- 
seling on a group of underachieviiig ninth graders. The experimental group 
was provided individual counseling and extensive group activities with 
heavy emphasis on orientation to the occupational world. At the end of a 
year, they found that the experimental group showed more improvement in 
appropriateness of occupational plans and had a lower dropout rate than 
did a no-treatment matched control group. 

Williams (1962) used different criteria for evaluating the outcomes of 
brief educational-vocational counseling, an index of personal adjustment. 
and an index of congruence between actual and ideal self. He used three 
groups—an experimental group who received typical vocational counseling 
of approximately two interviews conducted by an experienced counselor; a 
client-control group who had requested counseling, but were placed on a 
waiting list for the duration of the experimental period; 
group. He found that counseling restored a normal lev 
ment and degree of congruence for the experiment 
change was observed in the control groups. These improvements were in 
effect a year after counseling had terminated. 

Additional Reference: Thompson (1960). 


and a nonclient 
el of personal adjust- 
al group. However, no 


Personal-Social-Emotional 


Davis (1959) studied the effect of group guidance and individual coun- 
seling on citizenship behavior. From a group of 70 high school seniors. 
Davis identified the 30 with the lowest citizenship evades for the previous 
year and assigned these 30 at r | 


i j andom to three groups. Members of Group 
A were provided 20 periods of group guidance: Group B. 2 individual 


counseling interviews: and Group C, no treatment. Group C showed no 
improvement in citizenship grades during the experimental period. Groups 
A and B showed significant improvement, with Group A exhibiting the 
most improvement. Each subject in Group A, of course, received 10 times 


as much counselor time as did each subject in Group B. But the intriguing 
aspect of this comparison was that the counsel ie 


amonnto? dine on aug À enden Dus or himself spent an equal 

Kemp (1961) studied the effect of dogma 
open-mindedness) on counseling, Tw 
dogmatism) were given four indiv 
10-week period. The two control 
ceived no treatment. Neither of tl 
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tism (closed-mindedness versus 
vo experimental groups (high and low 
idual counseling interviews during à 
groups (high and low dogmatism) re- 
1e control groups made any significant 
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change in score on a problem check list. The high-dogmatic experimental 
group made no change in scores on the problem check list, but the low- 
dogmatic experimental group did. Again. a forced counseling situation 
4 exists, but in this study one may have a clue to what kind of person will 


profit from counseling under such conditions. 

: Arbuckle and Boy (1961) studied the effect of client-centered counsel- 
ing on high school students with behavior problems. Thirty-six students 
who had been placed on after-school disciplinary detention were divided 
into three matched, groups. Members of Group A were released from 
detention on the condition that they report for counseling for a 12-week 
period; Group B. continued on detention and received no counseling : 
Group C, released from detention and received no counseling. At the end 
of the experimental period. Group A showed an increase in congruence 
hetween ideal self and actual self and a favorable increase in teachers” 
behavior ratings. Groups B and C showed no such change. 


Additional References: Baker (1960); Hunt and others (1959) ; Zax 
and Klein (1960). 


Diagnosis 


In most of the studies cited in this chapter. crude but obvious diagnostic 
classification schemes have been employed, often grudgingly. Tyler (1960. 
1961) reluctantly decided that diagnosis is necessary, but preferred to use 
a different label. Arbuckle (1961) frankly rejected the whole idea of 
diagnosis in counseling- Apostal and Miller (1959) described a two-dimen- 
sional diagnostic classification plan, which was subsequently used by 
Shepherd (1962) and by Kirk (1962). Shepherd found that the graduation 

depending on the diagnosis. Kirk found 


rate of clients varied sharply, 
that the problems presented to high school counselors by clients differed 
only in minor respects from those presented to college counselors. As 


pointed out by Apostal (1960), there js a clear danger of masking signifi- 
cant results if different kinds of counseling cases are not studied separately. 
Additional References: Kirk (1959) : Weiner (1959). 


Methods 


Wrenn (1960) studied the effect of theoretical conviction on counselor 
responses and found that the only difference associated with theoretical 
orientation was that psychoanalytically oriented counselors used reflection 
less frequently than other counselors. On the contrary, the principal varia- 
tions in counselor behavior were associated with the topic being discussed. 
Brams (1961) found that the only personality characteristic of counseling 
trainees related to their ability to communicate with clients was a tolerance 


for ambiguity. 
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Hoffman (1959) developed a system for specifying and classifying 
counselor interview behavior by identifying several roles that counselors 
use in interviews, such as teaching and listening. - 

Frank and Sweetland (1962) studied the effects. of different kinds of 
counselor response on subsequent client response. Clarification of feeling 
responses was followed by client responses of understanding and insight. 
Counselor interpretation also gave rise to an increase in understanding and 
insight responses. These findings raise serious questions about earlier 
formulations that suggested that the reflec 


g tion and clarification of feeling 
are the only counselor responses that can produce client understanding and 


insight. 

The extensive literature on laboratory studies dealing with how one per- 
son can influence the behavior of another has been summarized by Krasner 
(1958), Schachter (1959), and Thibaut and Kelley (1959). McDavid 
(1959) found that group-oriented people were influenced to change more 
often than were task-oriented people. Staples and Walters (1961) noted 
that firstborns were more acquiescent and suggestable, Masling (1960) 
cited many studies indicating that the personality or behavior of the 
examiner significantly influenced the nature of projective 
given by the subject. 

Russell (1961) coached his clients so that some were negative and some 
positive in their approaches to counselors, The negative clients generated 
more counselor anxiety than did the positive ones, 

Martin, Lundy, and Lewin (1960) had one group of subjects talk to a 
tape recorder while no one else was present. Another group talked to a 
counselor who remained silent, and a thir group talked to a counselor who 
responded. Compared with the other ty oups, the third group evidenced 
greater approach beh anda greater decrease in 


ume dm i ue instructed. interviewers to 
reese tainly memor ects and nonfamily mem- 


as found to be effective in 
y other studies clearly indi- 
n can influence the verbal 


lest. responses 


and others (1960) ; Grigg (1961) : Heil- 
and Robinson (1960) ; Pumroy 


Summary 


a During i. e pus Years, progress has been made in clarifyir.g the 

omain m ; aarti ong and in developing theories to explain the 
process. Althoug the evidence is not conclusive, it strongly suggests that 
unless clients perceive a need for assistance from a counselor the counselor 
is apt to have little impact on client beh ; 


ehavior. Trends in the data indicate 
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that experienced counselors may produce better results than neophytes and 
that most experienced counselors choose their methods to fit the task rather 
than a stereotyped school of conviction. Laboratory studies of counseling 
prototypes clearly indicate that it is possible for one person to influence 


i x 
the behavior of another in an intended direction. 
Considerable clarification is still needed in the employment of outcome 
^ criteria. In. many instances, the outcome against which counseling was 


evaluated was not the intended goal of the counselor but an incidental by- 
product. Knowledge, of definitive cause-and-effect relationships in the coun- 
seling process would be greatly increased if research data were sorted into 
homogeneous classifications by some kind of diagnostic formulations 
applied to client and counselor characteristics and behavior. 
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CHAPTER VI 


The Appraisal Function 


LEO GOLDMAN 


Tins CHAPTER is restricted to publications that were concerned with or 
particularly relevant to the appraisal function itself. Excluded are most 
studies regarding the reliabilities, validities, and norms of tests. Although 
several of these studies contain important substantive and methodological 
contributions in the area of appraisal, many of them have been reviewed 
in the February 1962 Review, Educational and Psychological Testing. 
Although a somewhat more inclusive approach was taken with nontest 
methods of appraisal, only a very few items were located, 

General treatments of the appraisal function, including references to 
relevant research, were found in Cottle and Downie (1960) and in Goldman 
(1961). The three issues of the Annual Review of Psychology (1960, 1961. 
1962) that almost correspond to the period covered by this issue of the 
Review had no separate chapters on counseling, although some pertinent 
material was included under other chapter headings. 

The various reviews continued to reflect pessimism regarding the validi- 
ties of tests and of other sources of appraisal data. In particular, there 


was criticism of the lack of theoretical foundations for much of the work 
with tests. 


Test Taking and Its Preparation 


Instructions 


Hills (1961) compared the results on alternate forms of the California 
Test of Personality when subjects were told to anay 


administration and deliberately at another 
found, it was not clear to what : 
with a lack of comparability in t 
forms of the inventory. 


er rapidly at one 
Although differences were 
might have been associate 
actorial composition of the alternate 


extent they 
he f 


Practice 


In their study of practice effects i 
boltz and Christal (1960) discover, 
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effects to be minimized, use of different types of spatial tests with unlike 
specific variance would be necessary. 


Perception 


The subjects perceptions of a test and of its significance would be 
expected to influence his set, the effort he expended in responding to the 
items, and, therefore, his scores. Wrightsman (1960) found no significant 
differences in average score between those subjects who had been told 
that the test was a valid measure of intelligence and those subjects who had 
been told that it was not valid. Moreover, there was no difference between 
the two groups in the extent to which they took advantage of an opportunity 
to cheat through using not-well-erased answers in the test booklets. It 
would be interesting to know how individual subjects really perceived 
the test, as an unimportant laboratory exercise or as a test that was 
measuring something important to their self-concept. 


Sereening 


eness of personality instru- 
(Billard, 1962) and college 
rs, 1961; Kleinmuntz, 1960, 
ality 


Several investigators reported on the effectiv 
ments in the screening of high school students 
students (Caldwell, 1959; Goodstein and othe 
1961; Parker, 1961a, b). There was general agreement that the person 
profile at the time of counseling reflected more disturbance than the one 
taken during the orientation period. There was disagreement concerning the 
identifiability at screening time of those who later would seek counseling. 
To some extent. this circumstance might stem from the different kinds of 


students that different counseling centers were attracting. 


Scoring and Score Conversions 


noted serious errors in machine scoring of the Strong 
VIB). When seven men’s SVB answer sheets 
Were scored a second time by the same machine scoring agency, 35 per- 
cent of the occupational scores changed by at least one score level. Rescor- 
ing of two women’s SVIB answer sheets yielded changes on 52 percent of 
the scales, Even after the agency reported that the machines had been 
adjusted, 22 percent of the scores on a single answer sheet changed on 
Tescoring. 

The’ importance of loca 
though Goodstein and others 
cal Inventory differentiated amo 
Students on a campus, the scores o 
from the published norms for co 


>  ) 


Burack (1961) 
Vocational Interest Blank (S 


] norms was reinforced in several studies. A]. 
(1961) found that the California Psychologi- 
ng counseling center clients and other 
f the latter group departed substantially 
llege students. An interchange between 
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Kleinmuntz (1960, 1961) and Parker (1961a, b) led to the conclusion that 
different cutting scores for the Minnesota Multiphasic Personality Inventory 
were needed at two different universities on a scale of maladjustment. 


Nontest Methods 


As usual, there were very few reports of research with nontest methods 
of appraisal. The gap between possible and actual practice was scen in 
Hutson's (1960) summary of several dissertations on character-trait 
ratings. Several of these studies showed that carefully designed. properly 
used rating scales yielded relatively high correlations between such traits 
as reliability and industry and high school grades and between these same 
traits and college grades. On the other hand, a study of the rating practices 
of 100 high schools judged to be among the best in the United States led 1o 
the conclusion that most such plans violated the sound principles of rating 
that were established in the 1920’s. 

Weiss and Dawis (1960) studied the accuracy of information obtained 
in interviews with 48 physically handicapped clients of the Division of 
Vocational Rehabilitation and with 43 Employment Service applicants. 
When agency records and employer reports were used as criteria, the 
accuracy of the information varied from 100 percent for sex identification 
to 45 percent for the question on whether the person had received any 
assistance from the agency. It should be noted that the interviews were 
conducted not by counselors but by opinion-polling intery 
ently had visited the homes. The results might h 
information had been obt 


iewers who appar- 
ave been different if the 
ained during counseling interviews. 


Interpreting Data 


The largest number of studies rel 
on the interpretation of dat 
about a person or group. 


ating to the appraisal function touched 
a—that process hy which inferences are made 


Expectancy Studies 


Although no major methodological research was located, several studie? 
reflected a small increase in the use of established methods of producing 
specific expectancy data. One of the most ambitious projects was a state 
wide program for relating high school grades and college aptitude test 
scores to college grades (Hills and others, 1961), The product was 4 
manual for school counselors, which provided tables of norms and ? 
regression equations for each of the cooperating colleges and universitie® 
in Georgia. li i 
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The College Entrance Examination Board (1963) made available to 
counselors a valuable compendium of normative data about the freshman 
classes at 250 member colleges. The presentations were not all uniform, 
and not all colleges furnished complete information. The data included 
distributions of scores on the Scholastic Aptitude Test of the College En- 
trance Examination Board and of high school rank of applicants as well as 
distributions on the same variables for accepted students and for scholar- 
ship winners. Despite its limitations, this’kind of information should be a 
boon to high school counselors who have the ability and understanding 
necessary to use it properly. . 

One of the major avenues of improvement in data interpretation has 
heen the attention given to determination of local norms. Kaczkowski 
(1959), for example. described methods of preparing local expectancy 
tables, Juola (1961) used college freshmen data to illustrate several kinds 
of expectancy tables. several of which showed the interaction of two or 
three predictor variables and one criterion variable. One high school fol- 
lowed up its graduates into college and prepared distributions of high 
school rank for each college (Leussler, 1961). Four small schools collabo- 
rated in preparing expectancy tables that demonstrated the relationship 
between scores on an aptitude test for engineering and physical science 
and grades in an engineering college within a university (Oliver, 1960). 


Counselor Versus Statistical Prediction 


IB predicted the job satisfaction of engineers 
an did the judgment of any one of six college 
on the use of one or more other scales of 
1961). The SV’/B Group IV scale, which 


is considered suitable for technicians, correlated .50 with the reported job 
satisfaction of engineers. The counselors seemed to weight the scale used 
for engineers more heavily than the one for technicians. Brayfield’s (1961) 
comment on the article suggested an explanation for the outcomes of the 
investigation. Subjects who were low in occupational self-concept (the 
Group IV. scale typically correlates. negatively with occupational level) 
Were more satisfied with the occupational status that they attained as engi- 
neers than were those individuals whose expectations had been higher. If 
this hypothesis is true, it would appear that the failure of these counselors. 
as that of so many other counselors in studies of clinical versus statistical 


methods. stemmed from inadequate understanding and conceptualization 


of the worker-situation interaction. 


A single scale on the SV 
with far greater accuracy th 
counselors, who seemed to rely 
the SVIB (Lewis and MacKinney, 


Strong Vocational Interest Blank 


Stephenson (1961a) developed a method for classifying groups of scales 
on the SVIB that appeared to be more reliable than the Darley-Hagenah 
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method. A chart that listed all possible combinations of letter grades on 
the scales in each group served to reduce the number of cases in which 
arbitrary classifications could be made. 

Stephenson (1961b, 1962) also investigated the chance scores or gray 
areas on the SV IB. The first study demonstrated that gray-area scores were 
similar in retest reliability to other scores. Therefore, Stephenson con- 
cluded that gray-area scores were not likely to be the result of chance, but 
rather a consequence of low interest of the subjects relative to people in 
the occupation. The second study was less convincing. Subjects taking the 
SVIB on a retest were asked to fake in the direction of obtaining higher 
scores or lower scores as well as to fake in obtaining chance scores for a 
stated occupation. Subjects could fake successfully in attaining higher or 
lower scores but not in achieving chance scores. One questions the possi- 
bility of the task assigned the fake-chance subjects. A person can fake in 
the direction of a high or low score by visualizing the likes and dislikes 
of people in an occupation, but how does he visualize the chance responses 
of people in an occupation? 

Filbeck and Callis (1961) developed a verification scale for the SV1B 
that included 103 rare item-responses. The scale differentiated genuine 
from theoretical chance profiles on a cross-validation study, Reliability and 
normative data were also presented, as was the scale itself, 

Brown (1961) pointed out that a common error in interpretation of the 
SVIB results from misunderstanding the expectancy ratio. Strong’s conclu- 
sion that chances are 3.6 to 1 that a man with an A rating on an occupa- 
tional scale will enter a specific occupation is applicable only to samples 


that are similar to Strong’s sample in terms of the percentages of men who 
later enter each occupation in question. 


Some long-range validity data led Berdie (1960) to suggest that careful 
use of the SVB with high school seniors is justified zu 


Differential Predictability 


Two studies shed further light on individual differences in the predicta- 
bility of criteria from the same 


S t predictor variables. Berdie (1961) found 
some evidence, though inconclus 


: n ive, that a person's variability of response 
to the items on an ability test was negatively related to the predictability 


of his grades in college. Ghiselli (1960) continued his contributions in this 
area by describing procedures for determining which of two valid measures 
is the better predictor for a given individual. 

These two studies continued a line of research which may prove exceed- 
ingly valuable in both theoretical and applied outcomes. Not only may one 
expect improvement in the predictive efficiency of test and other data. 
but also one may gain increased unders c 


À t tanding of the dynamic interaction 
of various personal and environmental factors, 
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Miscellaneous 


Katz (1962) explained that a profile on the Kuder Preference Record 
(KPR) is the result of both ipsative (intra-individual) and normative 
(interindividual) comparisons. He concluded that since one cannot legiti- 
mately evaluate a KPR profile in either ipsative or normative terms, the 
interpretability of this inventory is open to question. 

Knox and Hoffman (1962) found thas the graphic dimensions of a test 
profile influenced interpretation by college students. In the analysis of data 
obtained, broader variance in judgments, higher reliability, and greater 
coefficient of multiple correlation were associated with the presence of the 
relatively wide spread of scores on the percentile graph than with the 
appearance of a relatively narrow range of scores on the standard-score 
graph. 

Discrepancy scores between amount of detail on the Draw-a-Man test and 
the Ohio State University Psychological Test (OSUPT) did produce a 
higher degree of prediction of scholastic achievement for counseled proba- 
tion students than did the OSUPT alone (Isard and Lasky, 1961). The 
authors suggested that the discrepancy score reflects the extent of integra- 
tion among a person's different abilities. 


Communicating Appraisal Information 

Although several related studies appeared, little research was reported 
which dealt directly with the process of communicating appraisal informa- 
tion to individuals or groups. This lack of published research constitutes 
an apparent decrease from earlier years. 

From a county school system in Florida, Durost (1959) reported that 
locally based stanine scores were found meaningful to teachers and parents. 
Burr (1959) described how the IBM card on which the scores were pro- 
filed represented an effective use of test results. Eells (1961) and Harris 
and Dole (1960) presented interesting graphic methods for portraying 


expectancy data. 

Those teachers who had made le 
highly a procedure for transmitting 
the cumulative records (Hopke; 1961). 


ast use of guidance records rated most 
to them a summary of data taken from 


Evaluation of Test Reports 


Again, short-term test-reporting programs were found to effect little or 
no change in student estimates of their abilities and of their anticipated 
grades (Robertson, 1960a, b). On the other hand, Herman and Zeigler 
(1961) reported that some degree of change was realized with parents of 
college students. It is questionable that these studies tapped deep-seated or 
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lasting changes. Furthermore, it would be surprising if such changes did 
occur as the result of one individual or group session. 


Conclusions 


The last few years saw a small burst of published research on the 
appraisal function, including suck topics as analysis of clinical interpreta- 
tion processes, comparison of clinical and statistical methods of data inter- 
pretation, the influence of different approaches to selecting tests in the 
counseling process, and the effectiveness of different methods of report- 
ing appraisal data and of making interpretations. Perhaps one is secing 
merely another example of what is too frequently an occurrence in educa- 
tional research: abandonment of a topic after a few studies have been 
reported. Yet the earlier studies on the appraisal function surely did not 
exhaust the subject; they merely suggested intriguing hypotheses that often 
challenged counselors to justify their questionable appraisal activities. 
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CHAPTER. VII 


Vocational Development Process 


GARRY R. WALZ 


Vocationan development as a major aréa of investigation is young and 
vigorous. If one tracés its beginnings to Super's Career Pattern Study, it 
!5 scarcely more than a decade old. In that short span, investigations of 
Vocational behavior have moved from conceptually shallow trait-and-factor 
Approaches to fertile occupational models that encompass the behavioral 
Sciences. 

Although more than 200 separate sources were pasily identifed fer Hie 
chapter, publications on vocational behavior have some of the qualities of 
an iceberg. Out of sight are the many studies and activities that provide 
the buoyancy for those that are published. The author has chosen to be 
objective rather than impartial, but much of interest and worth has been 
excluded. Throughout, articles of major theoretical or research importance 


have been f. avored. 


Career Development 


Some of the characteristics of the newer conceptual approaches given 


hy Borow (1961) in his succinct analysis of research on vocational 
evelopment were (a) production of more m pier Apos ten (b) 
"vé of Doth psychodynamics and ege psychology, and (e) preference for 
longitudinal iter designs. He calls convincingly for less research on 
the prediction of vocational choice and more on vocational striving and 


the meaning of work. In another review of research, Super, Tiedeman, and 
9row (1961) identified three issues to be resolved: (a) development of 
adequate statistical tools for career prediction, (b) construction of both 
theoretical and operational definitions of vocational behavior, and (c) 
development of comparable measures of vocational maturity at different 
age le 
t € empirical test of some of the hypotheses from Super's theories 
Was contained in an investigation of the vocational maturity of ninth 
&rade boys by Super and Overstreet (1960). Using s qe cg of jeu 
liona] maturity, they defined vocational maturity E = oe gcn 
Primarily preparation for choice and planning ahea . Super ( ) 
also found that the preferences of ninth grade boys were inadaquate as 
criteria for determining the need for or the effectiveness of counseling. 
Among Super’s significant contributions are his continuing insightful 
theorizing and the testing of his hypotheses through a longitudinal career 


“evelopment study. 
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Roe's theory linking childhood experiences to occupational interests 
was the subject of three investigations. Hagen's (1960) results did not 
generally support Roe's posited relationship between family experiences 
and later career choice. In a study of ministerial and chemistry students. 
Switzer and others (1962) found little in the students’ childhoods that 
was predictive of occupational choice. In an investigation of four groups 
of professional women, Utton (1962) found that only one Roe-based 
hypothesis was supported: social workers and occupational therapists 
manifested a greater altruistic love of people than did dietitians and 
laboratory technicians. Generally, the studies did not support Roes 
hypotheses. It is hard to generalize, however, since restricted samples and 
retrospective approaches were used. Perhaps Roe's major contribution to 
date has been to offer a series of hypotheses about family influences in 
career development, which have been a spur to research. 

Two studies were related to Holland's personality theory of vocational 
development. Holland (1960) obtained empirical support for his belief 
that preferences for occupational titles are significantly related to a 
number of personality variables. Tentative support for Holland’s contention 
that a person's level of occupational choice and aspiration is indicative 
of his evaluation of himself was obtained by Schutz and Blocher (1961)- 
Holland's aim to develop an integrating theory of vocational choice that 
is explicit enough to encourage research would seem to be a desirable 
and attainable goal. 

Operating on her premise that the core of individuality consists of ? 
person's choices and the manner in which he organizes them, Tyler (1961) 
investigated subjects’ choices, using methods from interest-measuremen! 


research and concept-formation research. Her intent w 


as to discover the 
individuality inherent in people who m 


ake common job choices. Progress 
to date has been encouraging; further validity studies lie ahead. 


Using Bordin’s theory of occupational choice, Schutz and Blocher 
(1960) found a significant relationship between vocational preference? 
and occupational stereotypes selected as self-descriptive by twelfth grade 
boys. Blocher and Schutz (1961) found that the twelfth grade students 
perceived both their self-concepts and their ideal concepts to be closer t° 
stereotypes of workers in occupations with high claimed interests tha? 


to their equivalent stereotypes of workers in occupations with little claime 
interests. 


Notable research contributions were made by Tiedeman and his €% 
workers. Their work encompassed formulation of a psychosocial model- 
development of new statistical techniques, and practical application of their 
findings. An investigation into the clerification of boys’ self-concepts W2* 
undertaken by O’Hara and Tiedeman (1959). Their results would seem t° 
demonstrate that boys’ self-concepts about work values and interests We"? 
clarified as they moved from grade 9 to grade 12, Aptitudes, however, were 
found to be relatively poorly perceived, even by able étudentg, in grade? 
9-12. Tt was hypothesized that there is an interest stage ending by grade 
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and a work-values stage extending through grade 12. Cass and Tiedeman 
(1960) found that interests were effective predictors of the selection of 
high school curriculum and that aptitudes not only were poor predictors 
of choice of curriculum but also were not even well understood until 
the senior year of high school. 

One of the most encouraging aspects of career development research 
was the use of theoretically rich models as a basis of empirical investiga- 
tions. Hopefully, within three or four years, one will see the use of broader 
populations and more longitudinal desigtis. If one can judge a theory by 
the research it fosters, it would seem that the work of Super, Roe, and 
Tiedeman could be regarded as major contributions. Bordin, Holland, 
and Tyler also served as sources for considerable research activity. 

Additional References: Borow (1960); Davis, Hagan, and Strouf 
(1962); Flanagan and others (1962) ; Holland (1961); Kibrick and 
Tiedeman (1961) ; Nachmann (1960) ; Segal (1961) ; Stephenson (1961) ; 
Stern (1961) ; Super (1960) ; Tiedeman (1961) ; Uzzell (1961). 


Occupational Psychology 


In perhaps the first long-term longitudinal study of job satisfaction, 
Hoppock (1960) conducted a 27-year follow-up of employed adults. He 
found older persons to be more satisfied, with the largest increases 1n job 
satisfaction occurring among those who changed jobs. Useful as his study 
was, additional and more adequately controlled studies need to be made 


before one can generalize. : " " 
Three studies were concerned with the relationship of personality needs 


lo occupations. Dipboye and Anderson (1961) found that high school 
seniors have occupational stereotypes that can be expressed. in terms of 
manifest needs. Nurturance, endurance, and order were perceived as needs 
Satisfied by the largest number of occupations. Gonyea (1961) used 
Andrews’ A technique to identify the dimensions by which college students 
view occupations. The usual interest and job classification grouping did 
not appear, A relation between the personality needs of individuals and the 
liking or disliking of job duties individuals are called upon to perform 
as members of a broad occupational group was discovered by Walsh 
(1959). 

In an investigation of the personality characteristics of occupational 
groups, Miller (1962) found that the groups differed on personality 
variables that appeared to be relevant to the characteristics. of the 
Occupations. The variables could be-divided into those dealing with goals 
and those related to means. 

Kinnane and Suziedelis (1962), using Super’s Work Values Inventory, 
found a high degree of relationship between work values and inventoried 
interests. It was hypothesized that an individual's patterning of interests 
reflects his values regarding work. In a study of the ordering of occupa- 
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tional values by high school students, Dipboye and Anderson (1959) 
found a tendency for all age and sex groups to rank interesting work first 
and security second. Independence was ranked low by all student gronps, 
It appeared that occupational values were formed by the ninth grade anc 
that they did not change during the remainder of high school. Both studies 
highlighted the need for further research on development of occupational 
values and the meaning of variances in values between and within occupa- 
tional groups. 

Nosow and Form (1962) edited an excellent book of readings on the 
sociology of work. Personality as it relates to occupational choice and 
job performance was a subject of an increasing number of investigations. 
Definitive research on the perceptions of occupations by people both 
within and outside an occupation are needed. There is room, too, for 
research on what the need-meeting qualities of occupations are and on 
how a worker tailors his job to meet his personal needs. 

Additional References: Crites (1962); Cureton and Katzell (1962): 
Froehlich and Wolins (1960) ; Harding and Bottenberg (1961) ; Katzell. 


Barrett, and Parker (1961) : Robinson and Connors (1960. 1961) ; Vroom 
(1962). 


Occupational Classification 


Few developments were noted in ex 
of the classification systems introduce 


Review. Norton (1962) studied the judgments of 70 workers about the 
needed capacities and education for 


ipat r jobs with the ratings given in the 
Worker Trait Requirements. He conc r rati 


t luded that while the ratings needed 
refinements, they would still be usefu 


S, d l to a counselor. 
In a critical appraisal of the Functi 


L onal Occupational Classification 
Project (FOCP), Walther (1960) demonstrated that important worker 
characteristics were not identified by use of the system proposed in the 
FOCP. He argued for a classification structure based on differences be 
tween people rather than differences in work situations. An occupation? 
system was proposed by Brender (1960) whereby the intrinsic j° 
satisfaction in each occupation would be identified and described. 
The counseling use of 


ges : ion systems calls for considerable 
counselor adaptation if emphasis is given to 


panding upon the early promise 
d in the April 1960 issue of the 


kd 
Women’s Career Development ' 


White (1959) investigated the relationsh; f 

i : ; "PE onships of self-concept and ° 
parental identification to girls Vocational interests and cocupation? 
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choices. Compared with girls secking careers, those aspiring to tradition- 
ally feminine roles were more satisfied with themselves and came from 
more stable homes, Moreover, the girls with traditional leanings did furnish 
evidence of clearer communication with their parents than did the career- 
oriented girls. The post high school plans of senior girls were found by 
Milliken (1961) to be realistically related to scholastic aptitude. 

A novel approach to the study of women’s interests was undertaken 
by Warren (1959), who sought to relate the measured interests of college 
women to occupational role. She found that the Strong Vocational Interest 
Blank did differentiate among the occupational roles held by women. 
More than 60 percent had high interest scores in occupations other than 
housewife and elementary teacher. 

Girls today are faced with conflicting pressures in developing an occupa- 
tionally relevant self-concept. There is also evidence that the majority of 
women will devote an increasingly large share of their adult lives to work 
outside the home. Yet little is known about the vocational development 
of women. The previously described studies contributed needed, albeit 
limited, information. Restricted populations and cross-sectional designs 
are hardly sufficient for the task. It would seem that the time is now 
right for longitudinal developmental studies of girls similar to Super’s 


study of boys. 


Sources of Occupational Information 


field is the development of occupational mate- 
rials that emphasize career development. Most of the materials smack of an 
earlier job choice era. A notable exception is the Widening Occupational 
Roles Kit by Lifton (1962), which seeks to promote the vocational maturity 
of junior high school students. Fictionalized job briefs and filmstrips are 
used to assist the student in occupational planning and decision making. 

Two studies on readability provide little optimism for improvement in 
the reading level of occupational materials. Using both the Flesch and the 
Dale-Chall readability formulas, Watson, Rundquist, and Cottle (1959) 
concluded that little had occurred in the last decade to make occupational 
materials more readable. The reading levels of most materials were in- 
appropriate for junior high school students; the materials on professional 
and semiprofessional occupations were especially difficult. Samples of 
occupational materials studied by Diener and Kaczkowski (1960-61) 
showed a wide range of readability on a given occupation between pub- 
lishers and between occupations within a given publication source. Until 
Occupational materials are written at the lowest. reading level of potential 
users, counselors will have to check for readability. 

In a particularly provocative article, Samler (1961) suggested that 
Printed occupational information was concerned primarily with the “eco- 
nomic man"— with wages, working conditions, and other factors established 


201 


A major challenge to the 


Review or EDUCATIONAL RESEARCH Vol. XXXIII, No. 2 


by job analysis. Consideration of the "psychological man"—the worker's 
role and the way his personality needs will be met—was practically non- 
existent. Support was given for a research program to assess individual 
needs and to discover the need-related characteristics of a given work 
situation. 


Knowledge seems to be at hand to improve occupational materials. Some 
changes are being made. Used as a stimulus for self-exploration, occupa- 
tional materials offer potentialities not yet realized. Research on the 
career-development use of occupational materials could provide the frame- 
work for the development of more appropriate materials. 


Summary 


A new ferment was evident in the area described by the term vocational 
development. There was also a notable lag. Neither materials nor practices 
showed the impact of the newer theoretical formulations. Relatively few 
investigators have done much to change the theoretical thrust. It will take 
counselors as a group to implement the newer conceptions. Only by the 
counselors’ actions will those materials and procedures be developed that 
capitalize on the mounting knowledge of vocational behavior. 
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CHAPTER VIII 


Group Procedures 


E. WAYNE WRIGHT * 


? 


Mas AKIN to the Biblical account of the tower of Babel is the “confu- 
sion of tongues” surrounding group procedures in guidance. In fact, one 
may question whether much change has occurred in this regard since the 
publication of the April 1960 Review, in which Hoyt and Moore pointed 
up the same semantic problems and suggested that, for the defensiveness of 
opposing camps, the proverbial lamb of individual counseling might well 
lie down with the multiple counseling lion. However, a few mediating 
attempts in the literature of the past three years are encouraging. 
Especially pertinent are the contributions of Goldman (1962) and of 
Mahler and Caldwell (1961), who proposed meaningful differences in 
process and in group composition as bases for distinguishing between 
&roup guidance, group counseling, and group therapy. It was Goldman's 
pinion that unsuccesssful experiences with group methods of counseling 
and guidance have been due largely to lack of understanding of the roles 
of content and process in the group activity being undertaken. . 

Wrenn (1962) gave some support to Goldman's premise by suggesting 
that the counselor of the future be aware of the values of various group 
learning experiences and that he define, on a continuum somewhere be- 
tween group guidance in the classroom and group psychotherapy, the 
Particular province of the counselor in utilizing planned group experiences 
as a supplement to individual counseling. Rinn (1961) pointed up the 
need for clarifying the distinctive roles and purposes of different group 
leaders and indicated that instructional versus developmental-centered 
orientations might well represent mutually exclusive determinants of sepa- 
rate group processes. He concluded that the type of process employed 
Would dictate the nature of the group leader's attitudes and his sub- 
Sequent role in the group. 

The topic discussions that follow reflect several other trends noted in 
the literature. The headings encompass those aspects of group work in 
guidance that received attention in the literature during the past three years. 


Group Counseling 


Group counseling received, by far, the greatest attention of all group 
Suidance activities, In the studies reviewed, the use of the term group 


Pronar 
* The author wishes to acknowledge the assistance of Glendon Casto, Karen Brown, and Jean Heaps. 
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counseling exceeded significantly the references to multiple — s 
although the terms were generally considered interchangeable as d B 
tinguishable from group guidance. At the same time, studies by Davis 
(1959) and by Kemp (1962). which reported experimental comparisons 
between group guidance and individual counseling, were not clear in 
specifying the type of group activity under investigation. The reader could 
not be sure, in either study, whether the group guidance process being 
evaluated against individual counseling was really multiple counseling. 
which certainly points up the need for continued efforts toward a resolution 
of the semantic problem mentioned previously. 

The attention given to group counseling was accompanied by greater 
acceptance of this procedure as a valid concept of counseling per se and 
by the recognition that the group process might provide some benefits 
which individual counseling does not. For example, Rossherg and Jaques 
(1961) felt that the one-to-one relationship was artificial and insufficient 
to achieve counseling goals aimed at studying a patient's multiple rela- 
tionships to the various social groups in his life. Evaluation of group 
roles in a rehabilitative hospital setting led Rossberg and Jaques to con- 
clude that the impact of a patiens various group relationships has direct 
significance for his rehabilitative potential. 

A policy statement of the American P 
(1961), which recommended training 
part of a counselor's minimum prepar 


growing awareness of some apparent benefits of group counseling. The 
same position was reiterated more recently by Johnson (1962) and by 
Roeber (1962). Wrenn (1960, 1962) indicated that the counselor should 
sre individual and group processes as having different values and stresse 
the need to distinguish the unique contributions and outcomes of cach 
approach. The following studies reflect some of the more recent expert 
mental efforts in this direction. 

Reporting on research being conducted at the University of Ilinois- 
Ohlsen (1960) and Broedel and others (1960) discussed a study in which 
29 high-ability ninth graders were randomly assigned to two experimenta 
and two control groups for group counseling and no counseling, respe“ 
lively, The experimental groups met twice a week for eight week 
An evaluation of the measures on criteria of academic perbermanctt: self- 
acceptance, and effectiveness of interpersonal relationships indicated sig 
nificant growth in the experimental groups over the control groups 0n al 
least two of the three criteria. Group counseling given to the two contro 
groups after the first experimental period produced similar improvement 


in one of these groups. Follow-up measures 18 months after counseling 
showed that the achieved gains had been maintained 


Attitudinal changes similar to those reported hy Ohlsen were observed 

. A E 7 "s ith 

by Cohn and Sniffen (1962) In a pilot study of group counseling wit 
underachieving and misbehaving seventh grade bovs, An experiment 
group of 8 hovs. selected from a potential sample of 14 on the basis ? 
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ability, willingness to verbalize feelings, and willingness to attend the 
group meetings. received group counseling twice a week for 10 weeks. 
Although unable to support their conclusions by adequate postcounseling 
criterion measures. Cohn and Sniffen felt that the group counseling 
experience had been successful in achieving the intended purposes of their 
pilot project. 

Unfortunately, limitations of time prevented Cohn and Sniffen from 
completing their intended experimental design, and this, coupled with their 
small sample and some apparent bias in selecting the experimental group, 
raises a question about the six boys who were not selected for counseling. 
One wonders what changes might have occurred among this noncounseled 
group during the experimental period and what the relative effectiveness 
of individual and group counseling might be in dealing with this type of 
reticent problem student, for whom the prognosis is probably less favorable. 

Spielberger, Weitz. and Denny (1962) provided partial answers to these 
questions about less-anxious students in an elaborate study designed to 
evaluate the effectiveness of group counseling for improving the academic 
performance of anxious college freshmen. Incorporating a number of 
interesting and well-designed experimental controls into their study, the 
authors demonstrated that the experimental subjects who attended group 
counseling sessions regularly made significant improvement in a 
performance over experimental subjects who did not attend regularly as 
well as over the noncounseled control group. However, noting that only 
one-third of those who volunteered for the counseling attended regularly 


i s i pplicability of 
enough to realize any benefits, the authors questioned the appl cab lity 
4 e approach to academic underachievement 


group counseling would be 


group counseling as a preventiv 
of anxious freshmen. They felt, instead, that : J 
indicated primarily for students with personality traits that would enable 
them to persist in counseling a sufficient number of times to benefit from 
the experienc 
cs a RS to group counseling was presented by Volsky and 
Hewer (1960). Designed for short-term educational-vocational counseling, 
the group procedure consisted, essentially, of a case conference approach. 
in which the educational-vocational plans of each Broup member were 
discussed by the group. The goal of each 50-minute session required that 
some acceptable decision be reached for an appropriate course of action 
to be followed by the individual discussed. Volsky and Hewer indicated 
that, although conclusive findings must await further research. experience 
with over 200 participants had been favorable. "T 
Two studies compared group counseling with individual counseling. 
One, by Baymur and Patterson (1960), which is reviewed in Chapter IX 
9f this publication. will not be discussed here. The other study, by Stockey 
(1961); evaluated the relative effectiveness of individual counseling, group 
counseling, and employment for reducing aggressive behavior among 
adolescent hovs who were experiencing difficulties in adjusting to school. 
reconnseling and postcounseling measures were used to evaluate changes 
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in adjustment as reflected by attitudes toward self. teacher evaluations, and 
academic achievement. The three experimental groups were also compared 
with a control group, with group size ranging from 14 to 16 subjects. 
Significant improvement in adjustment was shown by subjects of both 
counseled groups, but neither the employed group nor the control group 
demonstrated significant change. Group counseling was considered as 
effective as, but not superior to, individual counseling. 

Evaluations of group counseling in conjunction with individual counsel- 
ing were even more sparse than the comparisons between the two ap- 
proaches. While one study by Ford (1962) reported on the use of group 
and individual counseling as complementary processes with the same 
students, the procedures outlined in Ford's study seemed to describe a 
group orientation or group guidance approach when viewed in terms of 
the process levels suggested by Goldman (1962) or in terms of the conno- 
tations of group counseling as found in the other studies reviewed. 

Additional References: Gazda and Ohlsen (1961) : Waldkoetter (1962): 


Applications of Group Dynamics 


The extent to which research in group dynamics should be included in 
this chapter is a moot point, since the October 1950 RrviEw dealt eX 
clusively with that topic. The relations between theory and research i? 
group dynamics and theory and practice in group counseling and guidance. 
likewise, are not clearly defined. Since recent literature indicates a growing 
application of group dynamics principles to educational group processes 
brief reference to a few pertinent sources seems 

Two publications of the National "Training 


1961 a, b) pointed to the necessity for applying studies of group dynamics 
to various psychological, sociological, and educational settings. Bradíor 
(1961a) focused on the general nature of group dynamics and on the us¢ 
of these forces to help groups function most productively. Bradfor 
(1961b) treated group dynamics in terms of teacher-pupil and other class 
room-learning variables. Two additional works, by Lifton (1961) and by 
Glanz (1962), represented significant treatments of group counseling and 
guidance functions in terms of group dynamics principles. 

Pruitt (1960) summarized the research findings relative to group size 
and organization and suggested possible implications for the kinds ° 
groups with which the personnel worker is most generally concerned. 


appropriate. d 
Laboratories (Bradfor®: 


Counseling with Parent Groups 


Recent literature related to group work with parents changed in fot 
rc 


from that reviewed by Hoyt and Moore in the April 1960 Review. Reset 
during the past three years dealt primarily v 


g 
vith programs aimed at helpi”? 
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parents to understand and to utilize test data and other guidance informa- 
tion about their children. In general, reports of such programs indicated 
favorable responses from the parents who participated, but gave little or 
no evidence of beneficial carry-over to the respective students. 

McCauley (1962) outlined a procedure used for reporting the results of 
the school's testing program to parents of fourth graders. Parents who 
attended evening meetings were given the profile sheet of their child's 
lest results. These were explained in a group session. Profiles of under- 
achievers were specially marked so that parents of these children could 
be encouraged to make an appointment for individual consultation. Mc- 
Cauley did not indicate the percent of parents who responded in any given 
year or discuss the objectives or any observed values of the program. 

Herman and Zeigler (1961) conducted group test interpretations for 
parents of potential college freshmen. Parents were asked to rate their 
child on each test area and to predict his first semester grade average. 
Analysis of parent ratings made before and after the test-interpretation 
lectures showed some improvement in accuracy of postlecture evaluations. 
No attempt was made to determine the benefits, if any, that the prospective 
students might have realized as a result of their parents? involvement. The 
reported benefits of the program were noted primarily in the favorable 
opinions expressed by a majority of the participating parents. 


Orientation and Special Guidance Courses 

Orientation activities probably constituted the most videly used group 
Procedure of the typical guidance functions, but evaluative research with 
this procedure was, nevertheless, significantly lacking. Reporting on the 
Increased application of group approaches to student. personnel services 
in higher education, Kirkbride (1961) found orientation activities to be 
among the most relied upon of the group methods reported. Three studies 
illustrate varied applications of the group orientation procedure. 

Duncan (1961) studied the value of group orientation for preparing male 
reshmen to receive individual counseling. An experimental group of 19 
Students received two precounseling orientation sessions, while a control 
group of equal number received only counseling. Postcounseling compari- 
Sons between experimental and control subjects showed no differences in 
terms of self-ratings, realism of vocational goals, and retention of informa- 
tion received in counseling. Although group orientation enabled the coun- 
Selor to reach more students in a similar time period, it did not increase 
the effectiveness of the subsequent counseling. 


Siegel (1960) described a college placement program that utilized a 


half-hour group orientation session for disseminating general information 


to students seeking employment. Since the report did not mention any 
Ə je d H £ 
efforts to evaluate the orientation sessions experimentally, one may assume 


that the sessions were used primarily for purposes of expediency. 
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Equally inconclusive in terms of any measured outcome was a program 
reported by Seymour and Guthrie (1962), in which group orientation 
was used as a preparation for precollege testing and for test-interpretation 
interviews. The program was initiated to handle increasing enrollments 
prior to registration time and to provide practicum subjects for graduate 
trainees in counselor education. The authors were optimistic about the po- 
tential of the program for achieving both objectives. 

Glanz, Hayes, and Penney (1959) questioned the adequacy of college 
orientation programs and proposed a required psychology course as ¢ 
appropriate approach to student personnel goals with entering freshmen. 
Unfortunately. neither type of program received adequate research atten- 
tion in the literature that has been reviewed. The paucity of supportive 
evidence, therefore, leaves the significance of orientation programs and of 
special guidance courses dependent upon more stringent evaluation. 


Occupation Units, Career Conferences, and Vocational 
Guidance in Regular Course Work 


Many group guidance functions traditionally have been carried on a7 
special units of regular subject matter courses, but research reports of 
their efficacy have long been conflicting and inadequate. 

One of the best designed studies on this topic in recent literature was DY 
Rusalem and Darer (1960). The study, which represented. considerably 
more than the typical subject matter unit, was conducted in conjunction 
with a double-period, modified English class over a two-vear period. During 
this time, the experimental classes spent one 80-minute period each week 
discussing topics related to vocational guidance goals. In addition to the 
modified curriculum, the experimental groups received individual and 
group counseling, teacher consultation, and parent counseling. Six class 
groups of tenth graders, with 25 boys and girls in each group, constituted 
the study sample. Three of the groups made up the experimental population. 
and the other three served as controls. Comparisons on several criteria 
showed significant differences between experimental and control group? 
with all gains being in favor of the experimental group. 

Occupation courses and career days were apparently very low on the 
hierarchy of research interests. Although surveys by Sinick and Hoppock 
(1960) and by Carter and Hoppock (1961) indicated that occupatio 
courses were being offered regularly in many states and at a number ? 
universities, there was a dearth of 


: ; : research evaluating this particular 
guidance offering during the past three years, Equally tm were studies 
dealing with the subject of career days. However, Shulim (1959) discusse 


an experiment with career-curriculum conferences at the college level an 
outlined a number of modifications that were dev i 
mortem examinations” 
trials. 
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In view of the many known secondary schools that incorporate annual 
career days and occupation units as regular guidance activities, more 
research information regarding these procedures is greatly needed. Addi- 
tional applications at the college level also warrant further investigation. 
Sinick and Hoppock (1960) encouraged research in the teaching of occu- 
pations and offered to summarize and to publish reports of any such 
research. 

While not reporting on specific research, Caldwell (1960) and Mahler 
and Caldwell (1961) drew upon their experiences with a number of ex- 
perimental programs to help teachers and counselors to implement group 
guidance and group counseling procedures within daily classroom activities. 
Additional References: Brundage and Frank (1961) : Speary (1959). 


Summary and Implications 


The mounting research evidence in support of group counseling cannot 
be ignored. However, the distinctive merits of group counseling as com- 
pared with individual counseling are still inconclusive. While several 
studies outlined some of the benefits that their authors believed group 
counseling could provide, the assumed benefits of the group experience 
Were not often tested as part of the experimental design. What remains 
to be demonstrated more conclusively is the nature of situations and 
conditions for which multiple as well as individual counseling is most 
appropriately indicated. The values of individual and group counseling as 
Concurrent processes also need further investigation, — . 

The influence of group dynamics literature and the importance given to 
the relationship in counseling raises several other questions relative to 
needed research: What kinds of relationships are established in group 
Counseling, and how can these be identified and measured ? How are 
individual and group roles influenced by the relationships established? How 

© various roles affect the outcomes of the group process? Studies by 
Altman and McGinnies (1960), by Cohn, Ohlsen, and Proff (1960), by 
Noble, Ohlsen, and Proff (1961), and by Ohlsen and Oelke (1962) give 
Some direction for seeking research answers to these questions. 

Several studies suggested the need to identify criteria which are more 
Sensitive to charres sosunting among normal populations—changes that 
might be expected after one or two guidance contacts as contrasted with 
Criterion changes anticipated after longer-term, more intensive relation- 
TID. Difficulty in measuring slight and more immediate changes in the 
*havior of normal students may wel! account for the lack of significant 
research with group guidance procedures as well as the descriptive rather 
‘han experimental nature of the available studies. 

, Finally, it seems an unfortunate commentary on the research sophistica- 
tion of school counselors to note that an overwhelming preponderance of 
© research literature in guidance and counseling is still being reported. 
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if not undertaken, by counselor-educators rather than by pupil personnel 
workers. Inasmuch as group guidance procedures have special ae 
for elementary and secondary schools, the importance of more significan 
research from school practitioners cannot be overemphasized. 
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CHAPTER IX 


Program Evaluation 


C. H. PATTERSON * 


"Tuc IMPORTANCE of and the necessity for evaluation of guidance and 
counseling programs and services have been repeatedly stressed. But = 
Rothney and Farwell noted in the April 1960 Review, there is little emr 
dence that much is being done in this area. In the April 1957 REVIEW: 
Cottle also commented on the paucity of research relating to evaluation. 
The present chapter will indicate the continuing scarcity of studies esatto 
ating counseling and guidance services. However, although such studies 
continue to be few in number, the quality seems to be improving. wal? 
relatively more of them being experimental 
survey or simple follow-up studies, 

For the purposes of this chapter, guidance services are defined as perso" 
nel services related to and including counseling, performed by a profes 
sional staff apart from the instructional program but in a school setting: 
The writer is not concerned with other personnel services, such as at 
psychology, or with psychotherapy. Furthermore, since the chapter ! 
limited to studies evaluating the outcomes of serv 


the studies of processes or procedures that are covered in Chapters V 
VIII. 


LH H H T an 
investigations rather th 


: š include 
ices, it does not inch B. 
at 


The literature will be considere 
discussions, follow-up studies, 
lems, and summary and conclu 


i " Trai " oral 
d in the following organization: gener? 


" A rob- 
control group studies, methodological pr? 
sions. 


General Discussions 


Continuing a series of publications, Hill and Nitzschke (1960) presented 
procedures for obtaining evaluations of den™ 
and parents and provided a model study. Hill (1960) furnished a sim 


study of a school's lesting program by its staff. The Oklahoma Stat? 
Department of Education (1961) 


a guidance program from stu ilar 


the 


students. 


" 
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Follow-Up Studies 


Various classifications of evaluative studies have been proposed. For the 
purposes of this chapter. a classification into follow-up or ex post facto 
studies and experimental or controlled studies is adequate. 


Junior and Senior High School Studie3 


"There were few studies of the outcomes of guidance activities by means 
of simple follow-up. Moore (1960) administered a test of guidance 
information to freshmen, and again to the same students as seniors, and 
found a significant gain in knowledge. Students who had more than one 
counselor gained more; and, surprisingly. those who had untrained coun- 
Selors gained more than those who had counselors with some training. 
Gribbons (1960) reported on the use of the manual and guide, You: Today 
and Tomorrow, in nine eighth grade classes taught by seven teachers. Fifty- 
five of the boys and 53 of the girls were interviewed before and after use of 
the material and rated for various factors pertinent to vocational choice. 

Il scores were significantly higher at the second interview. Rubinfeld and 
Hoppock (1961) obtained opinions of a ninth grade course from students 
ìn occupations eight years later: although ratings of the course were not 
highly favorable, 80 percent felt that it should be a required course. Carlin 
(1960) found that only 40 percent of a group of college freshmen had 
had a course on the world of work while in high school, and Jess than 
50 percent of these reported that it had been helpful. 

Additional References: Cohn and Sniffen (1962) : Impellizzeri (1960) ; 


homson (1960); Wrightstone (1960). 


College Studies 


Glazer (1960) obtained questionnaire responses from 118 of 500 former 
Cents of a university counseling center at least 18 months following their 
eXperience; 90 percent declared that they were satisfied or very satisfied. 

ne wonders whether the same proportion of the nonrespondents felt 
the same way. Apostal (1960) obtained occupational data from 114 
Students C rien return) who had had vocational counseling. Three 

f occupational choice and adequacy of the 


Judges rated appropriateness o c T 
Counseling process on the basis of the counseling records. Eighty percent 


of the Occupational choices were rated as appropriate, and the counseling 
Process w 67 percent of the cases. Without a control 
ocational choice obviously cannot be 
s evidence of counseling adequacy. Gonyea (1962) compared pre- 
uni ing and postcounseling vocationa 

versity counseling center with 
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counseling. Nearly one-half changed their vocational plans BRE Jue 
counseling and postcounseling. About 60 percent of these changes wt n "x 
the direction of greater appropriateness, as noted by four judges. One dee 
later, only 29 percent were still following their postcounseling i pium 
However, those students whose postcounseling objectives were rated appr : 
priate had objectives one year later that were more frequently rated - 
appropriate. There was no improvement in grades following counseling: 
either for the total sample or for those whose vocational choices vc 
more appropriate. Gonyea concluded that appropriateness of leo eor 
choice does not appear to be a valid measure of enduring, meaning!" 

effects of vocational counseling. 

Additional Reference: Hackett (1960). 


Control Group Studies 


Junior and Senior High School Studies 


Shimberg and Katz (1962) reported the results of the use of the manual 
and guide, You: Today and Tomorrow, with eighth i 
dents. Scores on a guidance test followir 
room were significantly higher than the 
did not participate in the program 


and ninth grade deni 
ng use of the material in the beni 
scores of an equated group wi 
—a finding indicating that the progra P 
results in an acquisition of information or knowledge related to guider 
concepts. Davis (1959) divided 30 students low in citizenship grades in 4 
three random groups. Group A had twenty 50-minute periods of grou 
guidance; Group B had two periods each of individual counseling + an 
Group C constituted the control group, Counseling was nondirective. Grouk 
A showed significantly greater improvement in citizenship grades than 
Group B or C, although Group B also gained significantly. While the M be 
might appear to show superiority for the group approach, it should : 
noted that while Groups A and B were equated in terms of total course 
time, individuals in Group A had 10 times as much counseling "t 
another study, Davis (1962) compared two groups of 2] students each W 


n 
were potential dropouts. One group was given special attention, W aa 
included special counseling sessions, Members of the experimental Am 
received fewer failing grades. They were referred to the principal less ^ 
quently and for less serious problems, Moreover, there were fewer are 
outs in the experimental group. It j À now 


: S, of course, impossible to k 
which aspects of the treatment the results m 


Baymur and Patterson (19 
eight students in each grou 
sions); one, individual counseling (9-12 ses 
tional counseling session: and one served 
there were no significant 


216 


ay be attributed. evel 
60) compared four groups of underachie 


ges” 
4 85 
P. One sample had group counseling ( yA’ 


: à no 
sions) ; one, a single hove? 
as a control group. A 
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counseled groups combined showed an increase in Q-sort adjustment score 
and in grade average. Broedel and others (1960) reported a study of group 
counseling of gifted underachievers based on Broedel’s thesis, which has 
$ been reviewed by Hoyt and Moore in the April 1960 REVIEW. Following 
16 group counseling sessions, there was increased acceptance of self and 
others, improved scores on an achievement test, and improved ability to 
relate to peers, siblings, and parents; but there was no improvement in 
grades, e 
Arbuckle and Boy 41961) compared three groups of students classified 
as behavior problems. One group received client-centered group counsel- 
ing; the second, after-school detention; and the third, neither experience. 
Although the counseling was not voluntary, it appeared to be effective. 
Additional Reference: Lurie, Goldfein, and Baxt (1960). 


College Studies 


Richardson (1960) found no differences in grade point average, number 
9f dropouts, or number of transfers, in two matched groups of engineering 
students in the lowest one-third of their class. One group received an aver- 
age of 2.5 information- and advice-giving interviews; the other groups, no 
Special attention. Goodstein and Crites (1961) also found no apparent 
effect of two to five educational-vocational counseling interviews upon 
a probationary summer session. Moore 
ànd Popham (1960) compared a group given three student-centered inter- 
Views, a group given three content-centered interviews, and a control group 
&iven no interviews. There were no differences in grade point averages, but 

red interviews gained significantly in 


the group having the student-center ! € idea he heus 
average score on the inventory of academic adjustment given at the begin- 


mng and the end of the semester. ] . 
Williams (1962) provided brief exposure of educational-vocational 
f two interviews and a testing session, to a group 
red them on adjustment measures with (a) a con- 
trol group of applicants who were not given counseling and (b) a control 
8rOup who had not sought counseling. Counseling led to an increase in 
“Sort adjustment scores and in congruence among concepts of self, ideal 
Self, and ordinary person, which persisted four months after counseling. 
Watson (1961) compared equivalent groups of students with personal 
and emotional problems who (a) accepted counseling; (b) appeared for 
ne interview. but denied the need for help: and (c) did not respond to 
the invitation to appear for an interview. After two years, the first group 
Was surerior to the other groups in scholastic achievement and in other 
respects indicated by college records. Searles (1962) compared two groups 
9f superior college freshmen, one of which was given three interviews 
during the semester. in which they were informed of their selection for the 


Study because of their high ability and in which their educational-voca- 
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tional objectives were discussed. There were no differences between = 
groups in grade point average, number completing the semester, number 
placed on academic probation, or number placed on the dean’s list for 
scholarship. Stamatakos and Shaffer (1959) compared four groups of 
superior female freshmen students; each group received different degrees 
of special attention. One of them (an experimental group) received 
monthly structured personal counseling interviews with dormitory counse 
lors. The groups did not differ significantly in grade point average or mM 
participation in activities, although the students in the experimental group 
rated the program highly. 

In a well-designed and well-executed study, Spielberger, Weitz. and 
Denny (1962) found that anxious college freshmen who volunteered for 
and participated in group counseling (between 2 and 11 sessions pe! 
student) improved significantly more in grade point average than did à 
control group who volunteered for but did not receive counseling. 


Methodological Problems 


The recommended procedures for evaluative studies, such as those 


found in Patterson (1962) and in Wellman and Twiford (1961), may be 
outlined as follows: 


1. Select and describe the objectives or outcomes that the program O" 
service should achieve. 


2. Define the criteria that are acceptable as indicators of the achievement 
or lack of achievement of these objectives. 


3. Select or devise instruments to measure the defined criteria. 

4. Design the experiment in terms of the samples and their selection, the 
experimental or statistical controls, the administration or identification of 
the treatment or independent variable, and the application of the criterion 
measures. 


5. Perform a statistical analysis of the data to determine the effectivenes? 
of the treatments in terms of the outcome criteria. 


Difficulties are encountered at several points in attempting to implement 
this procedure. 


Criterion Problem 


The objectives or goals of counseling and guidance services are difficult 
to specify. The problem is not so simple as Tyler (1961) implies in ber 
“question of whether the criterion of counseling success should be satis" 
faction or adjustment" (p. 261). Tn the first place, there are more choices? 
and in the second place, it is questionable that either satisfaction or adjust 
ment is an acceptable goal (Patterson, 1958: Goodstein and Grigg. 1959): 
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The desire for specific and measurable criteria has led to the proliferation 
of easily observable and objective outcomes. These include such factors 
as reduction in dropouts, increase in high school graduates going to 
college, improvement in grades, reduction in underachievement, selection 
of a vocational objective, stability of vocational choice, stability in employ- 
ment, salary level, and salary increase. However, objections may be and 
have been raised to all of these criteria (e. g., Miller, 1961). Not all stu- 
dents should remain in school until graduation; not all graduates should 
Zo to college; not al! students should select a vocational objective before 
high school graduation. Change in vocational choice may be desirable. 
Underachievers may have (a) values and goals other than academic and 
(b) certain talents that should be developed. : . 

Wolle (1960) has made the point that everyone should be aided in 
attaining the highest level he can reach in the area in which he has the 
Srealest talent and interest. In some instances, striving for academic 
achievement or even trying to be a well-rounded individual may be a 
secondary goal. Thus, if a student tends to neglect study of academic ae 
Jects in order to practice the piano and thus becomes an ier ling 
Pianist, he may contribute more to society and to his own me us 
and satisfaction than if he attempts to excel in academic epp. n ke is 
tion, grade point average is an inappropriate criterion or pe. 

irected to achieve better personal p a i personal adjus 
ment may actually suffer following such counseling. 

The d is ihai none of these criteria can be accepted as adequat for 
all individuals, Investigators must move on to the measurement of effects 
relevant to the individual: Indeed, the development of the potentialities of 
the individual through self-actualization seems to be generally accepted as 
the objective of counseling and guidance services, though it n aten lost 
in the desire to use a tangible, objective criterion. reap E his ol jective 
18 the development of an adequate self-concept. This includes p 
and understanding of one's self and attitudes, the ne o mee ^ 
of self-esteem and self-respect, the achievement of eile = e self- 

rection, and the ability to take responsibility for gies eir ag o 

cians and actions. Progress has been made in measuring such fac os 
although the profession does not have adequate instruments, in terms o 
reliability and demonstrated validity. However, it would appear to be more 
Profitable to struggle with the task of converting such € and we 
into measuring instruments than to continue employing oft o ie ut 
Steri]e criteria which are used too often. There is evidence that researchers 
are moving in this direction. Attempts to measure the self-concept by such 
struments as Q-sorts have been reported, for example, in a study by 
heldon and Sorenson (1960). . " 

Miller (1961) suggested a different approach from the generally 
accepted one, in order to avoid the problem of determining or agreeing 
Upon objectives with their value connotations of goodness or badness. In 
this empirical approach, specific guidance procedures or services would be 
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provided, and then differences between groups receiving services and those 
not receiving services would be sought. This, of course, is actually a com- 
monly used approach. Although it might avoid the problem of agreeing 
upon desirable goals, it does not avoid the problem of determining what 
to measure and how to measure it. 


Problems of Experimental Design 


Research in counseling and guidance encounters the basic problems of 
experimentation with human beings. Patterson (1960) continued an earlier 
discussion (Patterson, 1956) of these problems. In addition to the difficul- 
ties posed by sampling, there are the problems of controls. It is not usually 
acceptable to withhold a service from one group in order to have a bas! 
for determining its effects on another group, although it might be argue 
that this deprivation would not be great. However, it may not be necessary 
to withhold services, since other methods, such as delayed treatment. the 
own-control method, and the provision of differential treatment to Con 
parable groups, are useful (Patterson, 1960). However, the problem 9 
equating groups remains. Randomization of sampling or of the application 
of treatments and use of matching techniques pose difficulties (Patterson: 
1956). All relevant variables, such as motivation. changes in family pres 
sures and finances, and other environmental changes, usually cannot 5^ 
controlled or measured. They may operate to negate or to affect differen 
tially the effects of treatments. As a result, it is seldom possible to be sure 
that lack of results really means that the treatment was ineffective, or t! 
results obtained are really due to the treatments. Nevertheless, it is encou" 
aging to see, in the studies reviewed, the increasing concern with attempts t° 
achieve control of unwanted sources of variance, 

A factor that has not been adequately recognized is the nature of the 
treatment variable, that is, the kinds of services provided. One aspect ° 
this problem is the impossibility of being able to determine what aspee 
of a program is responsible for any obtained results when the treatme? 
consists of a complex of services or activities (Patterson, 1955). This con 
founding was present in a number of | al, 1960; 
Davis, 1962; Glazer, 1960; Rubinfeld atako? 
and Shaffer, 1959). 

A second aspect of this problem is present even when only one servio 
such as counseling, is provided. Counseling is not always counseling the" ; 
are differences, not only among individuals but alis; among general E. 
proaches or points of view. One must specify the nature of the ccunseli?? 
involved; few studies do this. Too many studies do not indicate the wa 
ing and experience of the counselors involved. When such informa" 
has been indicated. it appears that most of the research has inv?" 
partially trained and inexperienced counselors. 
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Tyler (19601) stated that no additional investigations are needed in 
which directive and nondirective approaches are compared, since a few 
studies have shown no differences. However, these studies are all question- 
able because of the use of inexperienced counselors and counselors employ- 
ing an approach in which they had no confidence or real understanding. 
Moreover, having the same counselor use both approaches, which osten- 
sibly controls the counselor personality variable, actually introduces error 
related to differences in counselor attitndes, experience, and skill in dif- 
ferent approaches. In most studies one does not know what the treatment 
has been when counseling is said to have been provided. 


Summary and Conclusions 


counseling and guidance services 


Although progress in the evaluation of 
wing the past three years. There 


is present, it has been relatively limited du 4 
has been an increase in controlled studies, but insufficient attention has 
been paid to methodological aspects of such research. Contradictory results 
are prevalent, These are probably related to differences in samples as well 
as to other factors noted in this chapter, such as duration and intensity of 
the services provided, nature or type of the services provided, and experi- 
ence and skill of the counselor. 

As in the past, some studies have 
to be effective. In other studies, the r 


pothesis. Some of the best controlled stu a 
live results. This state of affairs is not necessarily to be taken as support for 


the null hypothesis that counseling and guidance services make no differ- 
ence. Rather it is related to a number of factors that make it impossible to 
compare studies or draw any conclusions from them as a group. These 
factors include the use of different and sometimes inappropriate criteria, 
Some of which are inconsistent with the goals and objectives of education 
and counseling. In some cases, the criteria chosen are unreasonable in 
lerms of what might be expected from the services provided. Berdie (1956) 
las pointed out “expected changes in larger goal-related behaviors may be 
naively optimistic or presumptuous” (p. 346) following short-term coun- 
seling. It is hard to believe that investigators expect results from one or two 
Interviews, Such studies are usually negative (e. 8. Moore and Popham, 
1960; Richardson, 1960; Searles, 1962) except when the criterion is the 
Acquisition of simple information or when vocational counseling of normal 
individuals js involved (Williams, 1962). Some of the best controlled 
Studies that have: reported minimum results (e. g.. Broedel and others, 
1960; Baymur and Patterson, 1960) have suggested that the extent or 
duration of the counseling involved was too limited. 
| $ mentioned previously. another factor contributing to lack of results 
‘as been the use of inexperienced and inadequately trained counselors. 
e finding that such counselors are ineffective has no necessary relation 
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found counseling and guidance services 
esulis did not negate the null hy- 
dies have yielded minimal or nega- 


221 


Review or EDUCATIONAL RESEARCH Vol. XXXIII, No. 2 


7 i ; xperienced and well-trained counsclors are 

to what might be found when experie 

used. a NM ; 
From the analysis of the deficiencies of current. research, a number 0 

suggestions or recommendations for future research are apparent: 


1. Consideration must be given to the goals and objectives of counseling 
and guidance services and to the criteria relevant to the attainment of these 
goals and objectives developed and used in future studies. 

2. Attention must be given to specifying and to defining the nature of 
the treatment variable in order that studies may be replicated and in order 
that one may know to what variables any results may be attributed. Study 
of specific, defined methods or services in terms of specific criteria will lead 
to knowledge of what leads to what and will enable investigators to select 
methods or approaches that will lead to desired criteria or outcomes (e. £« 
Davis, 1959). 

3. An adequate test of the effects of counseling, especially when criteria 
of personality changes are used, must provide counseling services that are 
sufficiently extensive and intensive to provide realistic expectations for 
such changes. It is unreasonable to expect superficial one-interview Coun- 
seling to have such effects. 

4. Any adequate test of the influences or effects of counseling must be 
based upon the use of counselors who are trained and experienced and who 
have confidence in the methods or approaches they use. 

5. Although it is of interest to study the effects of counseling on unmoti* 
vated clients or on clients who do not apply or volunteer for counseling. the 
primary concern is with individuals who are interested in or desirous 
of receiving counseling. Studies using involuntary 
the effects of counseling in a normal cour ‘ " 

6. Long-term follow-up is necessary to ascertain the nature and persi 
tence of effects. In some instances, there are delayed effects; in others, there 


may be superficial effects immediately following counseling which will no! 
persist. i E 


clients are not a test 9 
iseling situation. 


Controlled experiments which meet the 
expensive to conduct, not only in terms 
trols, but also in terms of time, includin 
and the follow-up period. It would app 
search, an adequate study requires mo 
investigator. The time is ripe for an extensive, long-term investigatio 
adequate financial spport, in which exi: imei * 


Š A tO 
sting knowledge may be applied 
the conduct of meaningful res " i y id- 
B 8 earch oa the effi : É nd gui 
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se requirements are difficult and 
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FOREWORD 


The first issue of the Review's first volume was devoted to the curricu- 
lum; the present issue is the eleventh to be so concerned. During the 
32 years covered by this series, the issues not only have mirrored the 
advocated and actual emphases of the American school curriculum but 
also have provided valuable assistance in advancing both the substance and 
methodology of research on the curriculum. The committee which prepared 
this issue sincerely hopes that it may contribute to the distinguished series 
of which it is a part and may serve to further the cause of curriculum 
research and theory construction. 

While this issue continues to give attention to several persisting curricu- 
lum topics, many noticeable changes have been made in its organization 
and in its treatment of research studies selected for rev 
chapters have been included, but, perhaps more import 
of others has been refocused. Such alterations in organization were made 
so that the issue might deal comprehensively with the curriculum field 
and specifically with important elements of curriculum development and 
planning. Conscious efforts were made to minimize overlap between chap- 
ters in this issue and those chapters in previous issues of the REVIEW. 
Where duplication of topics occurs, the concern of chapters in this issue is 
specific to the curriculum and not to the topics in general. Priority for 
review was given to evaluative and comparative research studies whenever 
possible. 

Selection of studies to review in this chapter was, in many cases, not 
an easy task. The most important factor explaining this difficulty was the 
decision to insist that studies be reviewed, not merely listed or annotated. 
To implement this decision, no more than 60 sources were reviewed in 
any one chapter. Most of the research writings selected for review were 


considered more relevant and, hopefully, more significant than others 
related to the topic of a particular chapter. 


The committee which develo 
predecessors in lamenting the 
and rigorously analyzed studies 
dissents from previous estimate: 
situation. The past three years a 


iew. Three new 
ant, the attention 


ped this issue of the Review echoes its 
paucity of well-conceived, well-designed, 
in the field of curriculum. Nevertheless, it 
s about the probable continuation of this 


p 5 appear to this committee to hold the prospect 
of important, if not dramatic, changes in the curriculum research scene. 


Among the reasons for optimism are a growing concern for research and 
study of truly significant problems, a developing Sophistication in research 
design and analysis, and an awareness of the relevance to curriculum 
research of models and of techniques originated in other theoretical and 
applied fields of inquiry. Added to these conditions is the increasing com- 
mitment of money, time, and resources, along with encouragement to 
engage in curriculum research. 
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The chairman wishes to thank the members of the committee and the 
contributors for their competent cooperation in this venture. Beset by 
many problems, individuals conscientiously worked on assignments, at 
times against formidable obstacles and at personal sacrifice. Special ap- 
preciation is accorded the Review editor who contributed so much to 
whatever success this issue may have. 


O. L. Davis, Jr.. Chairman, 
Committee on 
: Curriculum Development and Planning 
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CHAPTER I 


Chart or Charter: Recent Developments in 
Educational Discourse 


DAVID L. ELLIOTT and ARTHUR W. FOSHAY 
2 


Tuc DISCOURSE about educational matters—and especially about curricu- 
lum—continued at an accelerated pace during the past three years. One 
reason for the vast quantity of publications with emotional overtures is 
that education generally, and curriculum particularly, remain uncharted 
fields in which the function of scientific inquiry has been questioned at 
various times and in which the relationship between what ought to be 
obtained and what can be done is essentially unexamined. The statements 
with the greatest public impact are charters, creeds, and manifestoes. 
Meanwhile, the brotherhood represented by readers of the REVIEW con- 


tinues to act as if a rational examination of the hopes and possibilities 
ature could yield hypotheses which, when 


expressed in the professional liter 
tions in or for development 


tested, would yield a realistic basis for modifica 
of theories about the educational process. 

This chapter is based on the assumption that improved theories of educa- 
tion are desirable. It also assumes that many of the abusive criticisms and 
emotionally charged discussions arise because no coherent theory exists. 
If this latter assumption is defensible, then the attempts at theory making 
as well as the renewed criticism of the present state of educational research 


can be seen as having considerable significance. 


Criticisms of Educational Research 


Educational research has been criticized in many ways. Many have criti- 
cized it for the kinds of problems tackled; several, for the procedures used ; 
and some. for the way in which the results are, or are not, employed. 
Research directed at educational practice is suspect for many, not only 
because it involves applying scientific methods to human beings engaged 
in value-directed activities but also because it lacks rigor. Perhaps the most 
common complaint is that research. knowledge in education does not 
cumulate in the manner found in such fields as medicine or engineering. 

A bricf survey of the literature is illustrative. Coladarci (1960), Jensen 
(1962), and Joncich (1962) pointed to a lack of sophistication on the part 
both of researchers and of those who employ their findings. Joined by Belth 
(1962), Goodlad (1962), and Millar (1960), both Carroll (1961) and 
Coladarci (1960) urged that emphases be placed upon pertinent but central 
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research problems of a varied nature. Beauchamp (1961), Coladarci 
(1960), Conant (1961), Maccia (1962b), and Ward (1961) saw a need 
for a firm theoretical grounding for research. 

Generally representative of the trend of criticism are articles by Joncich 
(1962) and Tyler (1962). Joncich (1962) traced the rise of a scientific 
movement in educational research and the subsequent abandonment of the 
idea of a science of education during the first quarter of this century. She 
characterized research efforts sirce the 1930's as being overly analytic and 
fragmentary and exhibiting too little of the purpose, vigor, and “daring 
hypotheses” of the earlier period. Joncich concluded that education could 
and must assume the properties of a rational enterprise that is anchored 
to a clearly comprehensive and objectively derived theoretical framework. 

Tyler (1962) indicated the severe limitations of local evaluations as a 
source of useful generalizations and stressed the need for basic research, 
that is, research which, through developing and testing a theory, could 
serve to explain educational phenomena and to guide the development of 
more effective educational practices and materials. Conant ( 1961) made a 
distinction between trial and error experimentation which builds up prac- 
tical knowledge (e.g., recipes for particular situations) and experimentation 
Me aiii nore wih i ro gredi 
( à l » variety of situations). Upon such generaliza- 
tions, explanations could be based and predictions could be deduced. 

' aes same spirit, Hearn (1958) spoke of social work as a “‘science- 
See gear aah eee em 
an ethene Sgn Joni Y es he social sciences, Characterized 
th y 2 ; by analogy, education—is, however, at least 
partially susceptible to empirical study and theoretical formulation. It is 
not unthinkable that the artistic aspects of teaching. or lead 4 ERA «ht 
be made clearer and more familiar to the public 1 ra E ipn na 
s 5 public by employing the analytic 

and synthetic tools of literature, the fine arts, and aesthetics T a lc 
Bantock (1961) suggested that novelis j à gue deerit mn 


i i ts might help the social scientists 
to identify key components in and il 5 social scientists 
H : he feeling un n" : 
situations. g underlying typical social 


Nature of Research Theory 


Three Stages of Inquiry 


" It seems ser oe ccnfusion exists as to what theory is and 
ow it applies to e ucation. Northro 5 

characterized fruitful inquiry in ny d. ee loce d 
an analysis of the problem in which the basic elements are dizeriet Cb) 
second, a natural history effort embracing observation, descri ding, and 
classification; and (c) third, an attempt at deduciive Fansektion. al a 
theory—a process in which inferences are added to facts in order lo 
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obtain concepts and testable hypotheses. These stages are not necessarily 
apparent in the work of any one research worker, since he may be standing 
on the shoulders of others in his field. Moreover, the order in which the 
three stages are pursued may not be even crucial, provided that each one 
is followed through thoroughly. 

To the extent that the concepts used in research contain inferences, they 
depend for their moaning—and thus, ultimately, for their usefulness in 
publie inquiry—upon the theoretical point of view of those who do the 
inferring. If that point of view is a well-developed, explanatory, predictive 
theory, then the concepts within its domain are defined both denotatively 
and connotatively—both for themselves and for their inner relationships 
with other concepts in the same domain. Such concepts are known by the 
theorists who postulate them. It follows, then, that to borrow a concept from 
one domain to use in another, without bringing its theoretical roots along, 
is to cut it off from much of its meaning and to curb its usefulness. The 
lerms student, story, or teacher may mean the same thing to many people 
In many contexts; the terms learner, print media, and programed instruction 
depend for their meaning upon particular theoretical contexts. 


Analysis of Problems 


estions for improving educational research seem to fit into 
alysis of the problem. Carroll (1961) asked 
ocess with a time dimen- 


Many sugg 
Northrop's first stage of an 
both for a central focus on the school learning pr 
sion added and for close attention to actual behavior and real-life situations. 
He was joined by Millar (1960), who emphasized the subject matter disci- 
plines; by Jensen (1962), who stressed a need for careful study of the 
hereditary, developmental, and environmental factors conducive to learn- 
ing; and by Ausubel, who suggested in the December 1961 Review that 
another careful look at verbal reception learning be taken in the course of 
redoubled efforts to try to locate what it is that ma 


ingful. Also related were articles by Goodlad (1962) on subject matter or- 
provide for a range of student responses, 


gested a focus on different kinds of teach. 


kes any learning mean- 


Sanizing centers, which might 

and by Bantock (1961), who sug 

ng in varying contexts. : -— 

Educational philosophers. like Northrop, play an important part in this 

analytic stage. Conant (1961), Hardie (1962), and Maccia (1962c) join 

Northrop in urging the necessity of making distinctions between normative 
tual problems. 


problems involving values and fac 
Hardie (1962), Nagel (1960), and Scheffler (1960) analyzed selected 
educational terminology relative to contextual and logical considerations. 


Specificaily, both Nagel (1960) and Hardie (1962) called into question 


Some assumptions underlying measurement. . 
Page (1962) found a "verbal magic" rather than careful attention to the 


realities of school situations present in many of the attempts to relate 
behavioral science findings to educational practice. Conclusions drawn 
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from experiments with rats, for example, are often applied directly to work 
with school students without the necessary mediating inquiry. To remedy 
this, Page suggested three courses of action: (a) analyzing the physical 
realities of experimental situations, (b) harnessing excellent design and 
measurement skills to education-centered problems, and (c) expressing 
these problems in language shaped for education which is “intersubjectively 
testable and scientifically respectable.” 

Finally, Smith (1961) employed philosophical and analytical techniques 
to the concept of teaching and to its relationship to learning. He concluded 
that teaching should be examined as a discrete entity which, in being inde- 
pendent both of concepts of process and of specific conditions, would con- 
note its own forms, constituent elements, and regularities. Upon this conclu- 
sion, Smith and others have based large-scale research projects. 


Natural History Stage 


The movement from an immediately perceived to a described fact in 
Northrop’s natural history stage is illustrated in a project reported by 
Hemphill, Griffiths, and Frederiksen (1962). In a phase of their work en- 
tailing the building of concepts for use in research on educational adminis- 
tration, they proceeded in three steps. First, they drew upon material that 
described the job of an elementary principal. The summary of this material 
resulted in a taxonomic grid with 12 cells containing four areas of adminis- 
trative activity taking place on three levels. Second, incidents were created 
that might occur in any one of the boxes in the grid, that is, examples of 
activity in each area and at each level of administrative activity. From 
these examples, realistic facsimiles of problems with which principals 
might actually have to deal were devised. These items were presented to 
principals in simulated situations which, among other procedures, employed 
“in basket” tests and kinescopes of school situations. Finally, the data col- 
lected were scored in several ways and factor-analyzed. The factor analysis 
yielded eight basic factors or dimensions of administrative performance. 
It was pointed out that the categories had practically no relationship in 
language to the categories of the original taxonomy. This was a sy& 
tematic analysis of a process or function that went from common sense 
terminology to concepts intended to be more useful in describing the 
phenomena of educational administration, 

Several taxonomic schemes appear in the literature. These are variously 
called conceptual frameworks, conceptual systems, and models. (The term 
model is also used in certain special ways, which will be discussed later.) 
Since a taxonomy is not a theory but an arrangement of the basic elements, 
it represents essentially Northrop's natural history stage, which may lea 
to a theory. It should be emphasized that the chief shortcoming of the 
various curriculum frameworks arises from the difficulty that they are not 
related to inferences, hypotheses, or action. As curriculum guides, such 
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frameworks cannot function. They are pretheoretical and, therefore, two 
great steps away from action. 

Downey (1960) presented a “model for the improvement of secondary 
education,” which he had designed (a) to systemize existing knowledge 
about that field into three dimensions of the educational enterprise— 
substantive, procedural, and environmental; (b) to point up the need for 
further research and knowledge: and (c) to provide a plan for the im- 
provement of practice, Millar (1960) suggested that a conceptual system, 
or framework, serves an educational researcher by aiding in the identifica- 
lion of important problems. An adequate conceptual system should (a) 
aid in the finding of necessary relationships between various problems in 


order that the organization of the framework may be intrinsic to the prob- 
lems involved; (b) help suggest data sources; (c) serve not only to identify 
ave to be made but also to 


points relative to which curriculum decisions h 
designate resources by which such decisions may be made and imple- 
eria by which existing research results may 


ate framework is self- 
dge and changing 


mented; and (d) supply crit 
be evaluated. In addition to all this, an adequ 
correcting and, thus, modifiable in the light of new knowle 


conditions. 
X. : - 
Beauchamp (1961) saw what he called a “curriculum theory model 


as being one step beyond taxonomy on the way to construction of curricu- 
lum theory and subtheories. He stated that such a model contains assump- 
tions and. postulates, both normative and descriptive, that are basic to 
curriculum. and curriculum. development. definitions of technical terms, 
statements about ways to involve people in curriculum planning, and pre- 
scriptions for certain curriculum design features. The components of such 
a model are chosen on the basis of a classification system or taxonomy of 
the elements contained in curriculum development work. The specific factors 
included in the model determine its nature and the nature of the theory 
produced. The model is used as a guide to curriculum choices. When such 
Choices are expressed. Beauchamp believes that a curriculum theory has 


been formulated. 


Theory for Education 


"ns 
Two Usages of the Term 


been employed in the literature in two main senses, 
emphasis of this chapter, a definite distinction 
first usage which falls within the 
as described by Clayton in the 
is exemplified by an entire issue 
dealing with certain philosophical 
Whitehead and with their applica- 
ducation may include state- 
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The term theory has 

and, in keeping with the 

! must be made between the two. The 
omain of the philosophy of education 

February 1961 Review or Bayles (1962) 

of Educational Theory (1961)—an issue 

Statements or syntheses by Alfred North 

tion to educational practice. Philosophies of e 
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ments about the nature of man, society, and knowledge as well as normative 
statements concerning the proper aims and goals of schooling. Such philoso- 
phies are not necessarily consistent with the descriptive findings and theo- 
ries of the behavioral sciences. Moreover, deductions directly applicable to 
actual school practice frequently cannot he made from statements of aims. 

The second usage is illustrated by a group of articles in Educational 
Theory by Ausubel (1961) on verbal learning, by Page (1962) on experi- 
mental models, by Belth (1962)' on the prospects for a discipline of edu- 
cation, and by Clements (1962) on the use of the term theory in education. 


Theory in Relation to Curriculum Development 


During the past three years, several writers have stressed the importance 
of theory as the basis for carrying on educational research. Beauchamp 
(1961) presented perhaps the most ambitious treatment of theory relevant 
to curriculum development. In his book, he reviewed a number of state- 
ments about theory in the behavioral sciences and then proposed his own 
definition of curriculum theory. According to Beauchamp, theoretical work 
in curriculum is concerned with definition of technical terms, classification 
of existing knowledge, formulation of inferences and predictions, design 
of curriculum theory models to guide theory construction, and development 
of subtheories, since the immensity of the curriculum field prevents its being 
investigated as a whole. This subtheory category includes the interpreting 
of theories from other disciplines such as psychology for curriculum, rais- 
ing questions of curriculum design, seeking improved answers, and devising 
theory for curriculum design and other aspects of curriculum. Beauchamp 
concluded by offering a curriculum theory model of his own, both as 
an example of what he had described and as a start for further work by 


others. In the terms used here, Beauchamp's model, which is still a 
framework, also includes normative assumptions 


Theories in Educational Research 


Tyler (1960) characterized behavioral science theories 
tions revealing relationships, structures, and processes which are embedded 
in complex situations. The most useful of these theories not only guides an 
observer in grasping more readily the events in a classroom, for example, 
but also permits explanation of such, ey 


T i ents. In a later article, Tyler (1962) 
noted that theories could function in Wo mam ; 


search: (a) as bases for explanation of educational phenomena; and (b) 
as sensible guides for the invention of procedures and materials for educa- 


tional practice. The theory aids one in deciding what to look for and how 
to look for it. 


as conceptualiza- 


ways in educational re- 


Maccia (1962b) described a theory as a representative model of the 
phenomena in a particular domain and as a set of statements at the leve! 
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of conjectures that characterize the objects in the domain. Maccia and 

5 Maccia (1962) indicated that an adequate theory and characterization of 
aspects of the educative process are needed. 

Other writers seem to be in general agreement with the preceding state- 

ments on the function of theory for educational research. However, there 

j seems to be a great deal of disagreement and confusion concerning (a) 

what aspects of tke educational enterprise are accessible to theoretical 

formulations of the behavioral science type and (b) how one should go 


about obtaining or constructing a theory. The impression that one gains 
ry different from that which one obtains 


from Beauchamp's suggestions is ver 
(1962), Maccia (1962a), and Griffiths 


from the examples given by Tyler 
11959, 1963). 


Theory Defined 


and development of educational research, it 
arrow definition of descriptive 
ative theory, Griffiths (1959) 


At this point in the history 
seems well to start with a rigorous and n 
theory. In his helpful discussion of administr 
accepted a restricted definition by viewing a theory as furnishing a number 
of assumptions from which a set of unifying empirical laws may be derived 
by logical procedures. Similarly, Hearn (1958) proposed that a theory 
consists of a number of verified hypotheses that are internally consistent. 

Internal consistency is what makes the separate propositions & set. The 
definition of a theory can be tightened further by suggesting that (a) no 
one statement in the set should be derivable from any other statement or 
combination of statements in the set, and (b) the total set of laws cannot be 
derived without the inclusion of every member of the set. A theory is tested 
through the hypotheses or laws that are deduced from it. Thus, one crite- 
rion of adequate descriptive theory is that its laws are logically, or in prin- 
ciple, verifiable even though the necessary instruments or techniques may 


not have been invented yet. ivi Nr 
Fro ^& (1960) formulation, one cou say s É 
Vin ce o select or abstract from complex 


cation would help educational rescarchers to se^ 
situations certain variables that are systematically related to each other. 
Each set of related v s. constitutes a system; and the 


ariables, or concepts, con d t 
theory provides, in Griffiths’ (1959) formulation, not only a description 
of data sets in terms of symbols but also t 


he means for performance of 
Operations upon the symbols in lire with rules underlying the system of 
which the symbols are a part. The rules of a theoretical sy 


stem are its laws 
and the assumptions that explain the laws. The concepts postulated and 
defined by the theory are the variabl 


es contained in the system. These 
variables behave according to the rules or laws or, if not verified, according 


to the hypotheses proposed. Laws and hypotheses are “if-then” statements 
about the regularities of the interrelationships of the variables. To the extent 
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that the concepts employed in a research design are part of the same theo- 
retical system, they can be fruitfully manipulated in varying relationships 
to each other, and the conclusions or results will be generalizable to other 
situations in which the same or analogous systems of concepts or variables 
are to be found. 

Through covering only a limited domain, a theory helps the rescarcher to 
delimit the scope of his research effort. The limited scone in turn is a sub- 
division of the educational enterpris 


such as teaching or part of teaching. 
leadership, learning, or group behavior. It scems best, therefore, first to 
seek theories for education or curriculum— distinct theories of distinct 
aspects of the educational enterprise. Later, a comprehensive descriptive 
theory of teaching or a comprehensive descriptive-normative curriculum 
theory or philosophy may follow. 


Theory Making in Education 


In the literature of the past several years, two main approaches to the 
building of behavioral science theory for education at a descriptive level 
were proposed. The first is inductive, proceeding from observable data 
and working up as far as possible from hypotheses to laws and then to 
explanatory assumptions. The second is deductive, or, in Maccia's (1962a, 
b) terminology, retroductive, in that it involves borrowing theoretical 
formulations from other analogous areas of inquiry, substituting concepts. 
deducing hypotheses and laws, and testing the results against ‘observable 
data. 

The inductive approach to theory building, which is 
ously mentioned stages of inquiry suggested by N 
related to a set of rules for theory building worked 
as well as to the content of aforementioned writings of Beauchamp (1961), 
Hearn (1958), and Hemphill, Griffiths, and Frederiksen (1962). 

A similar approach was used in the study of intellectual functioning by 
Jean Piaget, as described by Hunt (1961). Piaget observed the behavior 
of children at different ages when they were presented with varying kinds 
of intellectual tasks. He made theoretical inferences about the presente and 


development of certain intellectual structures which might account for what 
was observed. 
The deductive approach was most com 


^ 1 y pletely stated in the writings of 
Griffiths (1959, 1963), Maccia (19622, c), [OUR and Macc (1962). and 
Tyler (1960). ee 


found in the previ- 
orthrop (1959), can be 
out by Homans (1950) 


Models 


Griffiths (1963) proposed an approach to theory making through the use 
of models. As did Tyler (1960), Griffiths suggested the use of the term 
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model to denote a theoretical formulation from an area about which a great 
deal is known. The available information is used to suggest laws, or hy- 
potheses, for an area about which little is known. Both pointed to the 
futility of using formulations from areas about which little is known to 
study other areas about which little is known. Both Griffiths (1963) and 
Maccia (1962b) illustrated this operation of theory development through 
an example taken from Brodbeck (1959) concerning the problem of 
whether the laws of epidemiology could»be used as a model for a theory 
of rumor transmission. In Brodbeck's example, the descriptive concepts 
of the laws of epidemiology were first replaced by letter variables to reveal 
the form of the laws. The concepts referring to diseases were then put into 
à one-to-one correspondence with those referring to rumors, and the letter 
variables of the epidemiology laws were replaced by the descriptive terms 
referring to rumors. This resulted in a set of hypotheses about rumors, 
which may or may not be confirmed. f 
Griffiths went on to identify some of the assumptions that underlie the 
use of models. For the study “of educational administration, Griffiths sug- 
gested that an open systems model could be applied to the dimensions x 
administrative performance identified through factor analysis, as n je 
study reported by Hemphill, Griffiths, and Frederiksen ird Fd 
assumptions about educational administration that would be involved in 
making this application are that (a) administrative je ME e 
eralized type of behavior found in all human organizations, e: (b) ad- 
ministrative performance involves directing and controlling behavior in a 


Social enterprise. PN ae 
In applying a model to a domain under study, one looks for isomorphism, 
‘ ationship between the elements of the 


that is, both for a one-to-one rel k s 
i i kind of relationship 
m f the domain and for the same 
einige elemento of the model and the elements of the 


to one another among the elements 

Ed 1 ri 
domain. Extended to theory, this statement can be interpreted to mean that, 
if correspondence in the form of two sets of laws in two theories exists, 


one theory is a model for the other. The propositions ie A 
model theory must be demonstrated as applicable to the aspect of education 
under : d 

In bos term retroduction for the process bs eu ligne ed 
furnish a basis or model from which another theory = e set pe 
Maccia (1962b, c) seemed to differ with Griffiths on the p whic 
isomorphism or analogy is desirable. Following an analysis pe iscussion 
of induction and deduction, she said that, since nether in ER a 
deduction covers the process involved in devising yea nx E a o 
not completely describe scientific inquiry. Maccia pum n conc E s 
three principal stages of scientific inquiry exist: (a) ges ee h 
which one may set up characterizations ; (b) Nego PERN T ich one 
can explicate characterizations; and (c) induction, through which one may 
evaluate the characterizations. 
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Incomplete Analogy 


An important aim in education is to devise new theory, that is, to 
formulate characterizations not already possessed about the domain or 
aspect of education under study. If Maccia’s main point has been inter- 
preted correctly, she differs with Griffiths in saying that in order for new 
theory or fresh characterizations to be fruitful, the. components of the 
theory used as a devising model-cannot be completely analogous to com- 
ponents of the areas under study. If they are strictly analogous, then the 
devising model may be already an adequate theory for the aspect of educa- 
tion under study. If the model is not properly called a theory, the terms 
framework and conceptual scheme may be preferable. 

Maccia (1962a, b) made her point about the nature of devising models 
by classifying models into several distinct categories. The two major cate- 
gories are the representational model, or a model of a particular set of phe- 
nomena or domain, and the nonrepresentational model, or a model for a 
particular set of phenomena. A theory that operates for a domain is a 
representational model, as in Tyler’s description of theory as a simplified 
picture that characterizes a particular set of phenomena and their inter- 
relationships. Although frameworks are also representational models, they 
differ from theory in other respects. 

In a survey of recent literature dealing with the use of models for the 
study of various aspects of education, Maccia (1962c) classified models 
as either representational, thus promising for explication and evaluating 
theory, or nonrepresentational, hence promising for devising theory. She 
identified those of the 37 models included in the survey that seemed to 
fulfill the criteria for theory-devising models. Among those studies identi- 
fied as suggesting models of the nonrepresentational, characterization type 
were the following: (a) Ausubel's exposition of a programmatic model for 
systematically ordering relationships, offered as a plan in the December 
1961 issue of the Review; (b) Carroll's (1959) formulation of a model 
for language behavior based on computer logic; (c) Griffiths’ (1963) 
presentation of a characterization theory which was used to devise another 
characterization theory; (d) Kjeldergaard's description, in the April 1961 
Review, of a theory of mediated generalization, in which a computer pro- 
gram was used to devise a theory of the acquisition of language; and (e) 
Sweitzer's (1963) proposal that two theories can be used not only to 
explicate and perhaps to evaluate each other but also to devise further 
propositions not contained in either theory. 

Finally, one can conclude that there is a third major source of the 
descriptive type of theory to serve and direct educational research. The 
first two sources involve the analysis and description of peculiarly educa- 
tional phenomena or patterns of phenomena. The third entails the employ- 
ment of theories of human behavior from the several behavioral sciences 
in the manner illustrated by Tyler (1960). The aspects of human behavior 
encompassed by theories in the behavioral sciences are common to all 
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human individuals or groups of individuals. Although the educational 
researchers and practitioners may make particular applications of such 
theories to the educational domain, there is assumed to be no exclusively 
educational theory underlying the behavior in question. By way of example, 
this idea seems inherent in Griffiths’ (1963) discussion of the application 
of systems theory to educational administration. 

Whereas a systems theory of administration could be employed to study 
the administrative function in any organization and a social psychology 
theory of group dynamics could be applied in the study of many different 
kinds of groups, there are areas in the educational enterprise, e.g., aspects 
of teaching or being a student, for which no theory from any other 
Thus, it seems important to bring to bear 
Northrop's three stages of inquiry on these two areas for which no adequate 
theory has been found or devised. One needs also to follow up Maccia's 
conception of retroduction through nonrepresentational models—an im- 
portant development in theory construction which has emerged from the 
Center for the Development of Theory in Education at Ohio State 
University. The use of a point of view from within a theory in one field 
as a path leading to fresh characterizations or theoretical pies Mire 
another field has apparently been the key to progress in much mie s 
inquiry in the past; the concept nonrepresentational may be a usefu a 
terion of selection in one's search for theories to use mm devising models 


for educational theory. 


discipline exists per se. 


Bibliography 

AusunkL, Davin P. “In Defense of Verbal Learning.” Educational Theory 11: 15-25; 
ey C. “Educational Research: A Criticism.” Harvard Educational Review 
pe Eee unen ictuforad Imperatives." Educational Theory 12: 129-39; 
SE Grorce A. Curriculum Theory. Wilmette, 
ELTH, Marc. “Prospects for a Discipline of Education. 
pone october ah, Meaning, and Theories." Symposium on Souls EA 
(Edited by Llewellyn Gross.) Evanston, Ill.: Row, Peterson and Co., p 

rebel a nin Operational Model for Language Behavior.” Anthropological 
wt pale esas Janay in Educational Research.” Phi Delta Kappan 42: 


339-43; May 1961. ; 
REET, TERA “Theory and Education. 
62. 


Ill: Kagg Press, 1961. 149 pp. 
” Educational Theory 12: 193- 


” Educational Theory 12: 124-28; April 


F » 

CorapAncr, Artur. "Towards More Rigorcus Educational Research." Harvard Edu- 

duum Review 30: $1 T tau Improvement of EGUNAO Washington, 

D QU es AE iio and Curriculum Development, a lepartment of the 
atio i jati 1. 24 pp. 

National Education Association, Pa ee AM 


del for Improvement.” School 
Downey, Lawrence W. “Secondary i 


i à ë 60. ^ » 
Ee Dio Aatun aa Issue: Alfred North Whitehead, 1861-1947.” Edu- 
cational Theory 11: 193-280; October 1961. a 


o5 oo l*l umm 


Review or EDUCATIONAL RESEARCH I ol. XXXIII, No 


GoopLap, Joun I. “The Organizing Center in Curriculum Theory and Practice. 
Theory. into Practice 1: 215-21; October 1962. 3 SPPN. 
GrirfITHS, DANIEL E. Administrative Theory. New York: Bureau of Publications, 
Teachers College, Columbia University, 1959. 123 pp. à ; aa 
GiurrrrHs, Daniet E. “Some Assumptions Underlying the Use of Models in Reseaneh. 
Educational Research: New Perspectives. (Edited by Jack A. Culbertson anc 
Stephen P. Hencley.) Danville, Ill.: Interstate Printers and Publishers, 1963. 
Chapter 9, pp. 121-39. is. 
Harpie, Cuanies D. Truth and Fallacy in Educational Theory. American edition. 
New York: Bureau of Publications, Teachers College, Columbia University, 1962. 
156 pp. e & 
Hearn, Gorvon. Theory Building in Social Work. Toronto: University of Toronto 
Press, 1958. 83 pp. ake 
HEMPuILL, Jons K.; Gnirrrris, DANIEL E.; and Frevertksen, Norman. Administra- 
tive Performance and Personality: A Study of the Principal in a Simulated Elemen- 
tary School. New York: Bureau of Publications, Teachers College, Columbia Uni- 
versity, 1962. 432 pp. 
Homans, Georce C. The Human Gr 
484 pp. 
Hunt, Josep McVicker. Intelligence and Experience. New York: Ronald Press, 
1961. 416 pp. 


JENSEN, ArTHUR R. “The Improvement of Ed 


oup. New York: Harcourt, Brace and Co., 1950. 


ucational Research.” Teachers College 
Record 64: 20-27; October 1962, 
Joncici, Geratpine. “Wither Thou, Educational Scientist?” Teachers College Record 
64: 1-12; October 1962. 


Maccia, ELIZABETH STEINER. The Conceptions of Mode 
Occasional Paper 62-114. Columbus: Bureau of Educational Research and Service, 
Ohio State University, 1962. 53 pp. (Mimeo.) (a) 

Maccia, Exizasern Steiner. Models and the Meaning of “Retroduction.” Publication 
62-110. Columbus: Bureau of Educational Research and Service, Ohio State Uni- 
versity, 1962. 16 pp. (Mimeo.) (b) : 

Maccia, ELIZABETH STEI Ways of Inquiring. Publication 62-107. Columbus: 
Bureau of Educational Research and Service, Ohio State University, 1962. 13 pP- 
Mimen.) (c) Ld 

Maccta, ELIZABETH STEINER, and Maccta, Gronce S. The Way of Educati Theoriz- 
ing Through Models. Publication 62-111. Columbus: usd ee 
and EUN Ohio State University, 1962, 16 pp. (Mimeo.) 

MirLAn, MICHAEL. “ Subje Nen au TRI. evt Qu 

N Lam. d e as de te dl Subject Matter in Curric ulum Inquiry. School 

NAGEL, ERNEST. “Philosophy in Educa 
Educational Research. (Edited 
Delta Kappa (Eighth St. and U 

Norturop, F. S. C. The Logic 
Meridian Books, 1959. 402 pp. 

D oF i ^ + 

SCHEFFLER, ISRAEL. The Language of E 
1960. 113 pp. 

Suri, B. OTHANEL. “A Concep| 
(Edited by B. Othanel Smith 
1961. Chapter fo pe spao. 

Sweitzer, Ropert E. “An Assessment of Selecte i ini ion." Educa- 
tional Research: New Perspectives. (Edited D ba Sef du 
Heng ey) Damala Ill.: Interstate Printers and Publishers, 1963. Chapter 13, 
pp. 199-231. 

T PR W. Bs, a emrin of the Behavioral Sciences to Educational 
esearch.” First Annual Symposium on Eq i (Edited nk 
W. Banghart.) Bloomington, Ind.: Phi Del a Kaona! Research. (Edited by Hr. 


l in Educational Theorizing: 


tional Research." First Annual Symposium. 0n 
by Frank W. Banghart.) Bloomington, Ind.: Phi 
nion Ave.), 1960, Chapter 5, pp. 71-84. 

of the Sciences and the Humanities, New York: 


ducation. Springfield, Ill.: Charles C. Thomas, 


t of Teaching." La. 


nguage and C. in Education. 
and Robert H. Ennis, Cie Ean 


nis.) Chicago: Rand McNally & Co» 


ta K: (Eighth St. ion Ave.); 
1960. Chapter 4, pp. 5870 ; appa (eighth St. and Union 
Tyer, Rapu W. “Specific Contributions of Research to Education.” h into 
Practice 1: 75-80; April 1962. 9 Education.” Theory 


Warp, Vircit S. "The Function of Theory in Progra: for the Gifted.” Teachers 
College Record 62: 532-39; April 1961. px EO ed 


944 


CHAPTER II 


Organized Knowledge Influencing Curriculum 
Decisions 


0. L. DAVIS, JB, 


S EnmbUS examination of the nature of knowledge and of its proper rela- 


ti i i : s 
onship to the school curriculum has been evident during the past three 
and ancillary activities has emerged the 
vledge is a major source 


al disciplines and of the 


years. From the resulting analyses 
rapidly developing recognition that organized knov 
of the curriculum. Thus, the nature of the intellectu 
practical technologies demands as careful and systematic attention in 
bie. development and planning as the nature of society and the 
carner reccived in the past. This response undoubtedly was influenced by 
multiple pressures, including (a) the recognition that the disciplines are 
instructive in addition to preservative in nature, (b) the renewal of interest 


and active participation in curriculum planning by academic scholars, 
roughout American society for 


(c) the expression of general concern thr 
the determination to deal effec- 


Increased excellence in schooling, and (d) 
tively with the modern explosion of knowledge. 

Developments of the recent past and prospects for the future, rather than 
Precedent or a fund of research evidence, justify the inclusion in this 
issue of the Review of a chapter dealing with organized knowledge and 
the curriculum. In this chapter, some of the recent analytical statements 
and descriptive reports treating the nature of knowledge and the curriculum 
are reviewed. A major purpose is to highlight those statements and reports 
felt to be particularly productive in terms of future development and 
research, Not attempted here is a comprehensive survey of attempts to 
analyze most disciplines for their curriculum implications, a historical 
treatment of the role of organized knowledge in past curriculum-making 
ormulations and efforts. or an analysis of the conceptions of knowledge 
revealed in recently inaugurated curriculum development projects. 


The Disciplines: A Curriculum Source Recognized 
(1960 and influential book, The Process of Education, Bruner 
e ! emphasized the importance of the structure of knowledge in curricu- 
3 aes anning and teaching. He pointed out that to know the structure of 
wheres is to know its patterns and concepts. To deal with the funda- 
ines structure of a field, he held. will make the subject understandable. 
ate transfer and retention of that which is learned, and reduce the 
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lag between knowledge dealt with in school and that produced by Mi ine 
Bruner gave special attention to the importance of curriculum prov is i 
for both intuitive and analytic thinking. His emphasis on the desirability a 
fostering intuition was consistent with his provocative position on the ind 
ture of knowledge. Bruner's (1962) more recent hook, On Knowing. pre- 
sented elaborations of his basic positions regarding structure of knowledge 
and thinking. The “lucky guessing" which he advocated should not be con- 
fused with intellectual anarchy: Discovery, which is based on one’s having 
prior knowledge with which to think, consists of reordering data in Ma 
that permit one to go beyond the data to new understanding. Bruner dic 
not see his papers as establishing the basic criteria for curriculum selection 
and organization. He presented, in a popular and engaging manner, in- 
formed and competent judgments on topics necessarily important in cur- 
riculum planning. Since he did not prescribe comprehensive solutions, 
curriculum workers and researchers must specify and evaluate applications 
derived from his general propositions. 

Foshay's (1961) address to the conference of the Association for Super- 
vision and Curriculum Development claimed for organized knowledge— 
the disciplines and technologies—overdue recognition as a basic curriculum 
source. Each discipline, he asserted. is characterized by three elements: it 
domain, its way of knowing or method of inquiry, and its tradition. To 
approach knowledge indirectly, as he believed curriculum making has done 
in the past, is to take knowledge for granted. His modest proposal was 2 
direct confrontation in the schools of the essence of the disciplines. In 
addition to his illuminating examples of direct approaches to the disci 
plines, Foshay discussed educators’ roles in translating the disciplines into 
subject matter appropriate for learning by school children. An indispensa- 
ble ingredient in this translation task was seen as the educators’ insisting 


that scholars effectively communicate their conceptions of their disciplines: 
For such communication to take 


Place, Foshay urged curriculum workers 
to ask the right questions, to listen attentively, and then to develop programs 
using the insights thus gained in addition to understandings of society an 
learners. Foshay also insisted that direct attention to the disciplines must 
not obscure or replace concern for important, complex problems to which 
the contributions of any single discipline would be inadequate. . 

Phenix (1962) stated that only knowledge drawn from the disciplines 1 
appropriate to the curriculum, His Cogent argument posited that knowledge 
in a discipline is instructive and, while knowledge outside the disciplines 
exists, it is noninstructive. Phenix indicated that disciplined knowledge 1° 
characterized by analytic simplification, synt 
mism. Also provocative was his contention that the disparity between the 
logic of the disciplines and the ic of instruction is imagined a” 
destructive to both scholarship and pedagogy. Phenix also called for 
acceptance of a realistic rather than a nominalistic view of knowledge f°" 
curriculum planning. 
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The Disciplines: Analyses and Implications 


: Two types of positive responses have followed the call for greater atten- 
tion to the disciplines in curriculum development. Several important 
attempts have been made to analyze particular disciplines and their struc- 
tures. The significance of these efforts lies not in that they were made, 
for some scholars throughout time have studied and discussed the nature 
of the disciplines. Rather, their importance is derived from the fact that 
they were made in the hope that clearer, more accurate, and more coherent 
curriculums might be constructed for elementary and secondary schools. 
Another response has been the formulation of proposals to take into 
account the various disciplines within a general curriculum design. 

A major contribution to the necessary understanding of the disciplines 
was a seminar convened in 1961 by the National Education Association, 
Project on Instruction. At this meeting, scholars discussed with educators 
fundamental concepts and inquiry modes from their disciplines. The 
seminar report (National Education Association, Project on Instruction, 
1962) summarized the basic ideas treated and suggestions advanced regard- 
ing the relationship of the disciplines and the school curriculum. One 
interesting proposal was a curriculum with “nuclear” and “cortical” com- 
ponents. Without some such curriculum arrangement, some participants 
foretold either an internecine power struggle between proponents of indi- 
vidual disciplines or a possible homogenizing synthesis of the disciplines 
by educators. Of importance in future seminars, perhaps more than the 
specific ideas discussed, will be not only the participants’ exhibition of 
genuine interest and concern for the disciplines and for the curriculum 
but also the stimulation of a continued and deepened commitment by 
scholars and educators to explore curriculum problems together. Also, 
forthcoming project reports may be expected to display the influence of 
the disciplines as a curriculum source. 

While Bruner and Foshay emphasized the importance of the concept of 
the structure of knowledge for both teaching and learning, neither described 
the structure of any discipline in detail. Indeed, one of the most penetrating 
criticisms of their proposals has been the charge of the possible extrava- 
gance of their claims. Disagreement is profound on the degree of precise- 
ness with which disciplines may be differentiated. While admittedly insuf- 
ficient, recent efforts to describe and analyze the structures of disciplines 
indicated that this task is necessary not only to an understanding of the 
disciplines but also to the advancement of the cause of using the disciplines 


directly in the school curriculum. 

Drawing on examples from the sciences, Schwab (1962) noted that a 
structure is comprised of those concepts that define the domain of the 
discipline and determine its inquiries. Since attention to structure is atten- 


tion to pattern rather than to catalogue, Schwab argued that Spencer's 

question, *What knowledge is of most worth?" long of interest to educators 
s , 

(Kazamian, 1960), may well be irrelevant. Johnson's (1962) attempt to 
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describe structure in the social sciences probably did not achieve success 
becnuse of his search for a common-denominator structure for these 
related but different disciplines. 


English 

Parker (1962) admitted that a pedagogical structure in English studies 
does not exist. He held that the subject of English should consist only of 
language, literature, and composition, The interrelationships of these ele- 
ments, while not well recognized, must receive attention if English is to 
be properly conceptualized, taught, and studied. 

In his delightful essay, MacLeish (1961) observed that more agreement 
may be obtained regarding what English is not than concerning what it is. 
His suggestion that English ministers between text and “the ground of our 
being” is poetically reassuring and a reasonable, yet incomplete, guide to 
practice. However, his affirmation helps set 
inquiry into the discipline or are 
represent a windfall. 


a proper course for serious 
a of study that is English; it does not 


Mathematics and Science 


Much attention has been focused on the national curriculum develop- 
ment projects in mathematics and the sciences. Most of these projects 
appear to have been related directly to their disciplines, However numerous 
the project descriptions that exist, misunderstanding and unintelligent 
acceptance of “new” curriculums persist. Helpful would be additional 
examinations, in depth, of the nalure of all the disciplines, such as the 
examination Rees (1962) made for mathematics. Rees asserted that mathe- 
matics is basically a language; that mathematical thinking is both inductive 
and deductive, both intuitive and analytic; that mathematical problems arise 
both from within and without mathematics: that the discoveries made by 
mathematics are useful to conceptualizing in other 
abstraction has produced simplification, (n m 
be of value in making a variety of decisions 
curriculum, 


sciences; and on 
anding these ideas should 

ie e 
about mathematics in th 


Social Sciences 


Efforts to improve social studies instruct; shi ramatic as 
those in the natural sciences, have Sae aan pid picis cem on 
analyses of the nature of the several social sciences, Important strides were 
taken by a symposium sponsored by the Am 
Societies and National Council fo 


needed, at present. are additional 
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r the Social Studies (1962). Much 


discussions about the nature of the 
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social sciences addressed more to the problem, “What is the discipline of 
history?” than to the topic, “What understandings in history should teach- 
ers have, or what understandings should be employed, in a school history 
course?” 

Progress in defining the essential characteristics of geography and 
economics for curriculum development purposes was more substantial than 
was work toward understanding the basic elements of other social sciences. 
In a publication for teachers associated with the national geography 
project, the initial chapter was titled "The Nature of Geography" (Patti- 
son, 1962). A high school American history course was suggested by Brown 
(1960), with a purpose of introducing students to history as a system of 
ideas and a system of analysis as well as to history as fact. Schrag (1961) 
reported interesting plans for a history course designed to bring students 
into closer contact with basic historical materials, methods, and demands. 
These efforts, which were related to history, like those of Davis (1962), 
were based on conceptions of the discipline that were more implied than 
elaborated. 

The observation was frequently made that sharp divisions between related 
disciplines are difficult to conceptualize. Hanna (1962) noted that structure 
is a human product to be employed as an organizational scheme, From this 
nominalist position, at such variance with the one sought by Phenix (1962), 
Hanna proposed that concepts and generalizations drawn from the several 
social science disciplines form the structural elements of the social studies 
curriculum. Hanna and Lee (1962) summarized results of a cooperative 
research project to identify and to group major social science concepts 
into such categories as “organizing and governing and “communicating 
ideas and feelings.” This significant research project may be expected to 
have an observable impact on social studies programs. However, its premise 
and conclusions may be seriously in error, as viewed from the positions 
of Phenix (1962) and Schwab (1962), inasmuch as concepts have been 
isolated from their source of origin and from the methods by which they 
were discovered and, consequently, by which they may be understood best. 
Moreover, the efforts may be seen as attempts to restructure knowledge 
from outside the disciplines. Whatever common elements may be discerned 
(Brodbeck, 1962) or whatever possible common groupings of disciplines 
may be formulated (Broudy, 1962; Kimball and McClellan, 1262) , to treat 
the “commonness” aspect as the most important one may be 5 homogenize 
the separate disciplines beyond recognition. The issue presented here is 
fundamental. Its resolution will determine, in no small measure, much of 


the character of future curriculum planning. 


Structures and Groupings of Disciplines 


and Miel (1963) made notable suggestions for dealing 
f the disciplines themselves but also with the 
nsive curriculum design. They devel- 
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Bellack (1962) 
not only with the structures 0 
structures as elements of a comprehe 
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oped the possibilities of including both discipline-centered and problem- 
centered components as advanced by Foshay (1961). — 

Partly to counteract undue fragmentation of knowledge, Bellack ( ; on 
proposed emphasis on the structures and strategies of groupings o i a 
ciplines rather than on the structures and strategies of individual discip m ii 
Additional work will be necessary, in his rather subdued nominalist sug- 
gestion, to distinguish between attention to particular systems of en 
or systems of inquiry and an orientation to a kind of reconceptualizec 
"broad fields" organization. - 

An integral feature of Bellack’s proposal was a coordinating seminat 
which would provide opportunities for students to deal directly with 
knowledge bearing on important human affairs. Such a seminar, perhaps 
similar to a successful venture described by Patterson (1962), would 
partially meet the expressed hopes of Van Til (1962) and of others who 
pressed for attention to societa] problems in the curriculum. 

À significant implication in Be 
context of studying an important 
test ideas and modes of inquiry g 
A basic issue involved here, and 


; With more accuracy and efficiency, be under- 


m patterns of ideas and inquiry methods 
specific to particular disciplines, 


how this proposal 
ivisions and indi- 
eration. Her plan 


Conclusion 


Curriculum theory building, 
edge, certainly will continue, F; 
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present results may be viewed, for purposes of research, as an inventory of 
propositions to be investigated. In addition to those problems identified in 
preceding sections, others deserve mention. One has to do with the balance 
between the necessary rhetoric of conclusions and the desired process of 
transaction in learning. Others include the relative importance of learning 
by discovery and by didactic instruction in specified types of situations; 
the types of curriculum structures, learning materials, and resources neces- 
sary for confronting the disciplines; and the procedures useful in effecting 
the institutional and attitudinal changes in prospect for the teaching profes- 
sion and the public. 

Continued efforts to describe, in meaningful terms, the nature of the 
disciplines will be of major consequence. Participation in this activity by 
the major scholarly societies and prominent scholars as well as by educa- 
tors is much to be desired. Forums must be provided for responsible 
analysis and criticism of attempts to translate theoretical formulations into 
school practice. Needed are schools and teachers open to experimentation 
employing newly designed curriculum components. In addition to examina- 
tions of particular disciplines, attention must also be directed to an analysis 
of knowledge itself. Helpful in this regard should be the analytic insights 
provided by Ryle (1949) and illustrated in several penetrating essays edited 


by Smith and Ennis (1961). 
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CHAPTER ITI 


Societal Forces Influencing Curriculum Decision 


GORDON F. VARS and WILLIAM T. LOWE 


Tins CHAPTER deals with societal forces that directly influence the selec- 
tion of subject matter or learning experiences to be provided by he 
schools. Excluded are developments within a discipline that may have 
changed the nature of content in a field, but that did not have explicit 
implications for curriculum planning. Also excluded are most studies gen 
erally dealing with social forces affecting the school as a total good 
A few particularly significant items of the latter are listed as additiona' 
references. Important findings in some studies are omitted because they 
were not specifically pertinent to curriculum decision making; the reader 
should not infer that the conclusions summarized here exhaust the signifi- 
cance of any particular study. 

Social realities such as race, social c 
are dealt with first, followed by social 
delinquency, and politics. Since Society 
lish categories; many studies deal with interrelationships among the vari 
ous aspects of society. Analyzing forces in a living, changing, incredibly 
complex society while living within it is no easier for these reviewers than 
it was for their predecessors, Breslow and others, in the June 1960 REVIEW: 
In some cases, the writers have supplied tl 
study's curriculum implications, Inevitably, 
grounds and biases of the reviewers, both in 
implications inferred. The reader is urged to 
nal and to draw his own conclusions, 


lass, peer culture, and the family 
problems such as school wap ite 
is an entity, it is difficult to esta? 


aeir own interpretations of x 
this chapter reflects the bac 
the studies selected and in the 
examine research in the org! 


Social Realities 
Racial and Ethnic Differences 


Recent research in intergroup relations appears to add little to the 8°” 
eralizations of Van Til in the October 1959 Review. One of the few stu” 
jes to draw curriculum implications from the desegregation process was 
Wiley’s (1961) study of desegregation in Evansville, Indiana. Here: i 
smooth transition was attributed largely to factors in the community, ga 
as the high respect accorded to the superintendent and members of s 
board of education. Even after desegregation had taken place, there Wê 


: ULT n the 
no planned program for discovering intergroup relations problems in t 
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schools. Moreover, only a very small percentage of the staff thought that 
human relations problems were important enough to warrant the refocus- 
ing of educational objectives or the reconsidering of school practices. In 
spite of the Evansville study. widespread repercussions of racial conflict 
argue that a school facing desegregation should develop and maintain the 
best possible human relations program. 

Attitudes in this area are dificult to change, of course. Campbell (1961) 
employed questionnaires, interviews, role playing, and a sophisticated 
social-distance scale to measure prejudice in a Southern university. Test- 
ing the hypothesis that'those who support racial segregation suffer from 
moral guilt because of the inconsistency of these tenets with the “American 
creed,” he found very little guilt—students either did not believe in the 
American creed or felt that the best way to follow the creed was to “keep 
the Negro in his place.” Teaching about democratic ideals and values will 
not, in itself, change attitudes about segregation. 

This is not to say that racial and ethnic attitudes cannot be changed. 
From his examination of existing research, Yoshino (1961) concluded 
that the teacher, curriculum, and experiences with fellow students can 
influence these feelings, and that deliberate effort at improving ethnic 
understanding does help young elementary school children. Scott’s (1961) 
study of work groups demonstrated that when management places to- 
gether adults of equal ability but of differing racial and ethnic groups to 
complete a job, human relationships improve. Key factors seem to be the 
equal status of members and the task orientation of the group, both of 
Which should be kept in mind when using the group-work approach in 


schools to reduce intergroup tensions. . 
Additional References: Hodgkinson (1962) ; Tumin (1957) ; U.S. Com- 
mission on Civil Rights (1961; 1962a, b). 


Religion 


nized religion and the public schools con- 
Political and ethnic issues frequently 


confounded debate over the place of religion in the curriculum. Bible read. 
ing, prayers, religious holiday observances, released ih sa ar ied 
of parochial school students at public expense, exemption rom s En 
Struction on religious grounds—all these were involved in case stu a 
local conflict in New Jersey, which were examined by Baker (19613. X- 
tensive interviewing and study of local press dispatches led Baker to con- 


i i r le in the degree of 
clud i hifts played an important role 1 
e that population s P ed out of the cities into the suburbs 


confli n i nd Jews mov e suburb 
and I challenged traditional Protestant doriination in 
many communities. Since this dominance was often — most ri 
matically in the public schools, it was there that the conflict occurred. 
Also important was what Baker termed “idiosyncratic individual be- 
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havior," actions of community members seemingly motivated a? = 
personal experiences and biases. There was no evidence that outsid rs 
ganizations, such as civil liberties groups, played any significant par Í 
initiating conflict. The degree of conflict was affected by the mature 9 
news coverage, actions of the superintendent, organization of protesting 
groups, and policies of the state education department. Clearly, local «a 
munity factors must be taken into account in dealing with religion in th 
curriculum. . ad 

Similar conclusions were drawn by Boles (1961), who made a carefu 
and systematic review of legislation, court decisions, and traditions affect- 
ing church relations with the public schools. He called for an objet 
study of religion as one important aspect in the development of man. He 
warned, however, that any practice that attacks or supports a particular 
sectarian point of view will cause increasing trouble as the population be- 
comes more mobile and communities become less isolated from society-at- 
large. 

Strong religious values tend to be associated with high ratings of dogma- 
tism, according to Frumkin's (1961) research with college studeni, 
Examining the relationships among dogmatism, social class, values, an 
academic achievement in sociology, he found that, regardless of other pan 
ables, the more education the student had, the less dogmatic he was. Since 
other studies have contended that dogmatism, or authoritarianism. P 
elated to social, economic, ethnic, and racial prejudice, Frumkin cor- 


: A E a à en 
cluded that instruction that reduces dogmatism in some areas tends to OP 
students! minds in others, 


If Frumkin is correct, a curriculum designed to reduce prejudice in any 
area is likely to affect students’ religious values. with possible repercussion? 
from parents and church authorities. Rosenblum's (1959) research sug” 
gests that these reactions will difer according to the degree of orthodox) 
of the religious group. He found that in religious groups classed as liberals 


the more religious the person, the less his ethnic prejudice; with orthodox 
groups, the relationship was reversed. Since public schools are committe 
to broadening students’ horizons, they must be aware of students’ prese? 
values; offer effective personal guidance for students trying to solve value 
conflicts; and keep channels of communication open among home, schoo” 
and church, so that conflicts can be mir 


nimized, 
Additional References: Corcoran (1960) : 


d 
Lenski (1961) ; Travers an 
Davis (1961); Tumin (1961). 


Socioeconomic Class 


The pervading influence of social class received additional documem 
tion by Havighurst and others (1962) in Growing Up in River CHY: i 
proximately 500 young people from age 11 to 20 were studied as they en a 
up in a typical Midwestern city with a population of 45,000. Throu’ 
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variety of measures, marked relationships were found between socioeco- 
nomic status and intelligence, adjustment, achievement, educational mo- 
tivation, educational level attained, delinquency, church activity and in- 
terest, age of first marriage, work adjustment. and initial adult adjustment. 
Many of these factors interacted. Considerable social mobility was noted, 
with educational achievement the usual way to rise in status. The school 
was urged to make up for the disadvantages with which some children start 
out in life and to provide a climate which minimizes conflicts between 
socioeconomic classes. 

Comparing mill (lower class) and town (higher class) sections of a 
Southern community, Morland (1960) found significantly lower educa- 
tional aspirations among lower class children, but no significantly different 
occupational aspirations. This study may suggest that guidance and in- 
struction should stress the relationship between education and career 
advancement. 

Rutledge (1960) also called for improved guidance and for 
of the curriculum to meet the needs of different status groups. He found a 
number of significant relationships between social status and the personal- 
Social problems of young adolescents revealed by questionnaires and by 
the Mooney Problem Check List. Though sex was not a significant factor 
in the number of problems checked, social status apparently had more 
impact on the girls than on the boys. As might be expected, the incidence 
of problems was highest for students of low social status, lowest for those 


of high status. When comparing achievement of Spanish American and 


Anglo-American children, Cline (1961) found that biculturalism by itself 
ic class does. 


does not affect achievement, but that socioeconom! . 

Either explicitly or implicitly, these studies secm to suggest special 
Curriculum provisions for children from the lower socioeconomic levels. 
This is truly an explosive issue if it is taken to mean conscious segregation 
of children by socioeconomic class. Conventional ability grouping is at- 
tacked even now on the grounds that it segregates social classes. Hence, it 
is difficult to see how such a proposal could win widespread acceptance. On 


the other hand, guidance-centered programs carried out in heterogeneous 
Ara al attention without labeling students. 


groups might provide needed speci | 1 
Additional fumme Dawson (1961) ; Fisher (1961) ; King (1961) ; 
Robins, Gyman, and O'Neal (1962). 


adjustment 


Peer Culture 


rtance of the adolescent subculture was further 
documented by Coleman (1961) in his analysis of the social systems of 
10 high schools in widely varied communities. Stressing differences be- 
tween adolescent values and those of adult society. he contrasted the pas- 
Sive dependency which the school imposes upon adolescents with the social 
emancipation granted them by their parents. Coleman asserted that intel- 
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lectual activities must capture the excitement of doing. a ord 
ing, and meeting a challenge, if they are to compete with ath etic, Scat 
and extracurricular activities for students’ energy. Even more be É 
were the implications for the cocurriculum, since interscholastic ath amr 
was seen as a major contributor to the status achieved by high sc ote 
boys, and popularity or social success, the major status factors for gir Sd 
To channel adolescent energy into academic pursuits, Coleman proposes 


Mn 4 ; : i : nes 
interschool competition in scholastic matters, such as intellectual gam 
and problems. 


Additional Reference: Smith (1962). 
Family 


As the family changes, so must those aspects of the curriculum devoted 
to family life education. Studies such as those by Roy (1961) and Wei 
(1961) indicated that employment of both parents is not necessarily harm- 
ful to their children in terms of academic achievement, social life, ie 
proneness to delinquency. These studies suggested, among other things: tha 
both course content and guidance progr 
the circumstances under which a carcer 
successfully. 


Havighurst and others (1962) found many relationships between early 
marriage, low intelligence, low social class status, and school dropouts: 
Furthermore, nearly all the young couples studied rated low in marit? 
adjustment. Nevertheless, they concluded that e 
most desirable course for girls who w 
ommended a special curriculum for 
as wives and mothers-to-he, 
a combination social worker 
Activities for the older girls, 


Tu 
had already left school, would be designed to give group members a soure 
of needed emotional support as they made the transition to married life- 


; ier 

ams should help girls to mundi 
; . a 

and homemaking may be combit 


4 : he 
arly marriage might be = 
ere unsuccessful in school: they ! 


jages 
ev eg om his survey of student marti pr 
in Missouri high schools, he concluded that one of the most effective dete 


operating in cooperation with all youth-serving agencies in the community: 

Additional References: Brim, Fairchild, and Borgatta. (1961) ; Dentler 
and Hutchinson (1961) ; Dentler and Monroe (1961) ; Riessman (1962) ; 
Westoff and others (1961). 


Social Problems 


, - ; —— 10 
The social realities described earlier interact in numerous le A 
produce pressing social problems, particularly in the nation's large C! 
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Dropouts 


Though not a research report in the usual sense, Conant’s (1961) Slums 
and Suburbs called the nation’s attention to the “social dynamite”? accumu- 
lating in the big cities as the result of dropouts and unemployment and 
complicated by social class and intercultural problems. Contrasting educa- 
tional opportunities in,slums and suburbs, he called for improved guidance 
services and for realistic vocational preparation, including work-study 
programs, for city youth. Burchill (1962) cited numerous detailed case 
studies to support the value of work-study programs for disadvantaged or 
"alienated" youth. 

New York's School to Employment Program (STEP) and Chicago's 
Educational and Employment (Double EE) experiment exemplify existing 
special curriculum arrangements for dropouts and potential dropouts. 
Unfortunately, in a study completed in 1961. Rioux (1961) found that 
less than one-fifth of the 100 largest cities had any special provisions for 
all proportion of the others had plans to 
develop such programs. Most special provisions were primarily vocational 
or prevocational and directed toward preparing the student for employment 
in a short space of time. Havighurst and others (1962) pointed out that 
a community-wide effort is required to solve the dropout problem. Besides 
work experience for boys, they advocated the preparation for marriage 
described earlier in this chapter. The study by Lichter and others (1962) 
of intellectually able potential dropouts showed that no one plan works best 
With all youngsters. 

Like Havighurst and his associates, Lichter and thers pom that 
adequale success experience in school was a key factor in a Pd. ents 
continuation. Johnson (1960). too. noled that two-thirds of the dropouts 
he interviewed had been school failures. In spite of this, the dropouts as 


a group did not dislike either school or the teachers and probably would 
hav «chool if they had been encouraged. 
vh aene pecca n Williams (1963) also indicated that 


The Maryland survey reported by g 
most dropouts were fairly normal youths, 49.8 percent being average or 
. both parents, and 79 


ween ee up 
above average in intelligence, 70 percent living with eS ds 
Percent not considered serious behavior problems by either their counselors 


or their parents. Although he concluded that major influences causing 
carly withdrawal came from outside the school, Johnson (1960) called for 
à number of curriculum revisions lo reduce dropouts. He recommended 
that participation in extraclass activities be encouraged and guidance serv- 
ices be extended, with special individualized altention to those with low 
grades and irregular attendance. The entire staff, he concluded, should 
concentrate on motivating students igh meaningful course content and 


through interesting instruction. 

More deos ebria modifications were advocated by Chandler, 
Stiles, and Kitsuse (1962) in an interdisciplinary symposium. Besides 
Spending more time in the study of social problems, schools should aid 
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desirable social changes like urban renewal by deliberately attempting w 
re-educate relocated slum dwellers. Advocating drastic overhaul chad 
politan school organization and administration, Havighurst a S Jon 
ommended that special attention be given to the culturally deprived at oe 
elementary level and at work-study centers for children who cannot pos 
from regular schools. New York City's Higher Horizons project has 
demonstrated the value of cultural enrichment for disadvantaged children. 
Tt is not just the large cities that are in difficulty because of demographic 
changes. In Plainville Fifteen Years Later, Gallaher (1961) documented the 
woes of a small rural town: economic decline. paucity of cultural ad- 
vantages, rigid class system, and exodus of capable vouth. Curriculum 
implications that are left to the reader would likely include attention to 
community problems in the school. an improved school cultural program, 
and vocational preparation geared to a mobile student population. — 
Additional References: National Education Association and American 
Association of School Administrators, Educational Policies Commission 


(1962); U.S. Department of Health, Education, and Welfare. Office of 
Education (1962). 


Delinquency 


Delinquency, too, seems to be rel 
and others (1962) found delinquen 
centrated among those of lower soci 
of school, had relatively low inte 
adequate homes. They postul 
quency because they are un 


ated to success in school, Havighurst 
cy among River City youth to be co” 
oeconomic class who had dropped out 
lligence, and came from broken or ne 
ated that at least some students turn to delin- 
able to achieve success according to the usua 
standards prevailing in the community. Elliotts (1962) questionnaire 
study tended to confirm this point of view. He found that both delinquent 
and nondelinquents held similar views of the role played by a given occupa 
tion or level of education in achieving success, but delinquents, regardles® 
of their social class, perceived themselves as having had less opportunity 
for achieving success, ù 

Karacki and Toby (1962) investigated a particular juvenile gang i? 
test the commonly held assumption that this form of delinquent behavior 
stems from economic and social deprivation, On the contrary, members 
of the gang were all white and nearly all of the same religious gtoUP: 
middle class, and from privately owned homes. The authors conclude 
that failure to develop commitment to adult was 
the basic source of gang cohesiveness, 
of the peer culture, such as demand for immediate gratification of therr 
desires, loyalty to peers. and assertion 
aggression. Karacki and Toby 
even in blight areas, 


260 


roles and adult values lucs 
S >» valuc: 
Gang members held to the valu" 


P ;sica 
of masculinity through phys i 
concluded that delinquency could be reduc ia 
z Ren H ra er socle 
hy developing, through schools and other $í 
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institutions, opportunities for the adolescent to develop allegiance to the 


values prevalent in adult culture. 
Additional References: Kinch (1962) ; Martin and others (1961). 


Political Problems 


Though cold war tensions must inevitably influence curriculum deci- 
sions, there appears to be little empirical esearch to document this fact. 
Feldman (1960) queried city school administrators about instruction con- 
cerning the United Nations Educational, Scientific, and Cultural Organi- 
zation. He found little instructional use of UNESCO materials and con- 
cluded that controversy over UNESCO in the Los Angeles area might have 
further reduced the study of the agency in that section of California. 

Local, state, and national politics also influence curriculum decisions. 
After studying political forces affecting school finances in eight North- 
eastern states, Bailey and others (1962) concluded that it is essential for 
educators to become active in polities. Robinson (1961) came to a similar 
conclusion from his review of selected legislation on school matters. He 
found rigid, conflicting legislation in many regions of the nation and 
concluded that one of the best ways to avoid curriculum rigidity is to 


provide federal funds to the states for broad educational purposes. 
and freedom to decide how the funds 


Each state would have wide discretion 


would be allocated to local districts. i 
Both Jordan (1961) and Samalonis (1962) studied the influence of 


State legislatures upon curriculum. Jordan concluded that Indiana statutes 
contain an excessive number of mandatory requirements which tend to 
make the Department of Public Instruction a regulatory agency rather 
than a source of leadership. After examining existing laws in all 50 states, 
Samalonis stated that the most common practice of legislatures was to 
make subject matter prescriptions without explicitly stating objectives or 
clarifying the intent of the statute. Physical fitness and the social sciences 
were the most highly regulated subject matter areas, in terms of both 
lime allotment and grade placement. : 
Pressure groups also make their influences felt upon the curriculum. 
l'arnand (1959) studied the social philosophies of three prominent groups 
Opposed to federal aid to education: the American Farm Bureau F edera- 
tion, the Chamber of Commerce of the United States, and the National 
Association of Manufacturers. He characterized all three as conservative, 
materialistic, and pragmatic. Decentralized government was favored by 
these groups, who believe that government should be primarily political, 


not economic, but that it should protect business. Since these organizations 
furnish th- schools with a great deal of free and inexpensive instructional 


are likely to be reflected in the curriculum. 


material, their points of view . Lac 
biased materials be eliminated, but that 


his is not to suggest that 
leachers must use them with discretion and must be aware of the points of 
view represented. 
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Selakovich (1962) studied techniques used by pressure groups Rp 
ing to influence the teaching of American history; namely. the Ameritan 
Legion, the Daughters of the American Revolution. the Conference o! 
American Small Business Organizations, the Friends of the Publie Schools. 
and the National Economic Council. These groups held similar concepts 
of Americanism, favored a strict interpretation of the U.S. Constitution, 
and endorsed positions generally hostile toward the United Nations. 
Charges of disloyalty or of subve 


ion were leveled by these groups against 
both teachers and the instructional materials used in schools. Teachers also 
were accused of being ignorant of the Constitution ànd of failing to teach 
Americanism. On the other hand, Kitaguchi (1962) found that 1n 
Hawaii, at least, the points of view expressed by organized labor groups 
did not differ significantly from those of a panel of educators. 

Although numerous groups seem ready to pounce upon the schools 
if they deal with controversial issues, some studies indicate that there 
may be less community opposition than is commonly assumed. Such was 
the conclusion of Coan's (1961) study of parents and teachers in. Kansas 
Rosenthal (1961) found in his study of two communities in New Jersey 
that educators generally had a rather wide scope for independent action. 
This was particularly true on narrow audience issues such as curriculum 
and personnel, as opposed to wide audience issues having to do with 
budgets and bond issues. A complicating element mentioned by Cooper 
(1959) was the fact that teachers, principals, and superintendents interpret 
external pressures differently. Sometimes, they do not even recognize the 
forces being exerted upon their curriculum and 

Additional References: Ayars (1961); Martin (1962) : Munger and 
Fenno (1962); Newman (1961); Palmer (1961) ; Rossi and Dentler 
(1961); Schmidt (1962) ; Todd (1962). 


teaching. 


Economic Influences 


Continued prosperity and booming technolo 


nomic situation in which curriculum decisions are being made today: 
Neither of these appears to have been 


: arg to d extensiyely researched. Rhoades 
(1961) found wide Variations in the degree of optimism reflected in publi- 
cations dealing with the implications of automation for education. Mor? 
advanced technical training seemed to be indicated, although automation 
might require less training in specific skills and more attentidri to bas!c 
sciences and general education, Students should be prepared for change: 
and their education should be oriented more to the future than it i$ 2 
present. Two likely outgrow 


ths of automation—technological unemploy” 
ment and a shorter workweek—highlight the need for education in the Wis? 


use of leisure time. Ford's (1962) analysis of leisure time activities of a&° 


residents of Indiana indicated that schools of the past could take liile 
credit for interesting these individuals in satisfying recreational activities. 
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iu E s E ma tr m xd Eee 
Lyle, and Parker (1961) studied tele isi me npe di " ee MD 
je wanker: d studied television s impact upon children in a 
" ge study involving 6.000 children and 2,300 teachers and parents. 
Ere dar ades that, for most children under most conditions, television 
"i : 1er paitieularty beneficial nor very harmful. One fact of impor- 
x ce to curriculum planners is that children with average intelligence 
mu have wider vocabularies if they are frequent viewers. Previous 
un a ae eee that television may be sending children to school 
es epis but superficial acquaintance with much of the world that 
inte too young to understand. Further research on this question, par- 
P. vx y at the local level, should enable schools to adapt the curriculum 
actual sophistication of the student. population. 
" OR deeper significance, perhaps, is the image of American character 
portrayed by this influential medium. After a years study of network 
television programs, Bell (1961) characterized this image as that of a 
oe class, middle-aged. middle-of-the-road male—a nonheroic figure 
per pga os society. His interest is in immediate, personal problems, 
val A philosophy or in world problems, and he is governed by relative 
alues. Schools must deal with this image as they attempt to modify student 


M : : a 
alues through curricular experiences. 


Conclusions 


al forces affecting curriculum deci- 
ative conclusions: 

es of societal forces upon deci- 
ted by the research. reviewed. 


, Examination of research on societ 
sions leads the writers to the following tent 
"d The basic assumption of the influence 

l concerning the curriculum is suppor 

hat is taught, why it is taught, and how it is taught are affected by 

ynamic social forces. Study of these forces is a continuing responsibility 
of all educators. ! 

2. There is a growing body of 


ate number of relevant doctora : 
ature of the topic, this chapter has emphasized applied research. 
3. There still is not enough research in this field. Significant questions 


remain almost totally unstudied, especially the curricular implications of 


S H 
9me of the basic social problems and changes. 
rt or replicate older research 


$ 4. A number of studies simply suppo 
‘A sag that schools all too frequently do not apply research findings 
a P 

ready available. s 

5. Numerous studies emphasize the importance of providing curricular 
Cxperiences that are meaningful to the learner in terms of the cultural 
Ontext ; K ee 

iol in which he is living- 
ge he additional references liste 

neral overviews of the area than do mos 


Consj i i 
nsidered in this chapter. 


basic and applied scholarship in this area. 
| studies is encouraging. Because of the 


d below represent more comprehensive and 
t of the sources that have been 
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2); Bi stok Sayres (1902) : 
itional References: Bell (1962): Bienenstok and Sayres ( 
e eee; e Wronski, and Hanson (1962); Halsey, Floud. and 
Anderson (1961); Ladd and Sayres (1962); National Society for the 
Study of Education (1961). 
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CHAPTER IV 


Psychological Forces Influencing Curriculum 
Decisions 


NORMAN D. BOWERS 


Waar ARE the psychological forces that influence curriculum decisions? 
The immediate and traditional response of many educators includes ne 
areas related to the pupil: principles of learning, characteristics of pi 
growth and development, and personality factors. The inference custommatiy 
made is that possession of accurate information in each area would aic 
one in making valid decisions about the curric S 

However, other answers to this question are possible; and an alternative, 
although seemingly more complicated, was selected, In addition to the 
characteristics of learners and the learning process, psychological factors 
that influence curriculum decisions were considered to include the pe 
tional variables that are manipulated in the school environment to promote 
learning as well as the skills and the abilities that accrue to the learner a5 
a result of education. 

This chapter will be concerned with recent rese 
tion about four areas: the chara 
ing, the effects of different tasks 
skills and abilities that can be yi 
intended that this chapter not 
existing and extensive revie 


ulum. 


arch that gives informa 
cteristics of learners, the process of learn- 
and methods, and the definition of various 
ewed as outcomes of school learning. It 5 
only will direct attention to the already 
ws of psychological research but also T" 

evaluate some current research that appears promising. Moreover, it Nu 
suggest areas in which more research or a different approach appear? 
appropriate. The distinction between social and psychological forces 17 
arbitrary at times. Chapter III of this REVIEW is concerned with many ° 
the forces that could be classified as psychological within the framework ? 
an alternative organization, 


The Learner 


Extensive and numerous studies 


" :or charac 
FR ; about various learner behavior chara 
teristics exist, Among these, a con 


. i i» 
siderable synthesis of information “the 
apparent in the Sixty-Second Yeurbook of the National Society for 


Study of Education (1963), which dealt with child psychology- Tus 
lengthy bibliographies, which were available in the Annual Review of PSY 
chology, edited by Farnsworth (1961.63), represented a valuable 2? 
useful source of research studies in developmental psychology. Specialize 
reviews concerned with the assessment of anxiety (Martin, 1961) a 
consideration of the self-concept (Lowe, 1961) offered not only a collec? 
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romae a e a e eee 
s Wo s ariz S e exceptional 
child. The availability of these publications should be helpful in the deriva- 
tion of accurate summary statements that relate to learner characteristics. 

In addition to these reviews, three different kinds of publications were 
of special interest relative to curriculum decisions. The first of these, 
Growing Up in River City (Havighurst and others, 1962), represented a 
study of certain features of the maturation process in a group of 487 
individuals over a period of nine years. Concerned primarily with social 
institutions and their impact on adolescent behavior, the psychological 
factors that were studied included talent, personal and social adjustment, 
and school progress and achievement. Although relatively simple analyses 
of the data were given, a final report undoubtedly will offer additional 
insights that can be derived from this study. 

The 1962 Yearbook of the Association for Supervision and Curriculum 
Development (1962) purported to describe how schools may help to 
develop sufficient, adequately functioning, and self-actualized persons. 


Containing much of interest and many statements on which there could 
rived initially from prepared 


ast within the framework of 
little new 


be almost universal agreement, material de 
Statements of well-known authorities was € 
phenomenology. Although many interesting insights were given, 
quantitative evidence was presented. : . : 

The third, somewhat longer publication was the Sixty-First Yearbook 
of the National Society for the Study of Education (1962), which dealt 
With the perennial topic of individual differences. Various topics were 
Summarized and discussed, and helpful bibliographies of research were 
given, This important area of individualizing instruction certainly remains 
One of the primary psychological forces that affect curriculum decisions. 


The Learning Process 


Of the numerous learning investigations completed during any given 
lime, few seem to have direct relevance to the school curriculum area. 


fter a review of the studies related to learning that were reported in the 
Journal of Experimental Psychology, it is hoped that more symposiums, 
Such as the one edited by Carton and Hively (1959) in which Spence 
(1959). Melton (1959), and Underwood (1959) participated, will be held. 
n this symposium, application of laws, or principles, of learning to the 
Classroom and to the school situation were considered. In the papers pre- 
sented, it was proposed that a mutually supporting relationship be built 
between the science of learning and the technology of educational method. 
An illustration was given of the relevancy of a laboratory psychological 
Variable to a school situation. " . 

Two recent longer books are noteworthy in summarizing material and 
Presenting different insights about. human learning. learning theory. and 
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teaching. Edited by Glaser (1962), the first book, which distinguished 
between training and education, reported the results of numerous military 
research programs involving the learning of complex skills. Although the 
point of view was somewhat different from that usually encountered in 
many curriculum publications, such as those prepared by the Association 
for Supervision and Curriculum Development (1959, 1960), curriculum 
workers not only should take heart, but also take heed of the suggestions 
made by interested and obviously competent researchers. The second 
publication offered a survey and interpretation of current. psychological 
theory (Hill, 1963). Together with a clearly written contrast of learning 
in the school situation and of learning in the laboratory, a most informa- 
tive essay on the present-day value of learning theory was included. 

An interesting exchange of considered opinions that were related to 
verbal learning took place between Arnstine (1962) and Ausubel (1962). 
Such controversies, which are rarely published in the educational literature. 
should happen more frequently because clarification of one’s ideas about 
the nature of the learning process is afforded. 

Representative studies about learning with implications for curriculum 
decisions included those of Klausmeier and Check (1962). Lockard and 
Sidowski (1961), and Bilodeau, Sulzer, and Levy (1962). Continued study 
of the learning process in the laboratory and in applied situations is certain 
to continue. For educators, there exists a real need for the research to be 
forged into some kind of meaningful scheme. Clarification of curriculum 
theory might serve to simplify the task of selecting and analyzing important 
and relevant atomistic psychological studies related to learning. Work 


such as that of Beauchamp (1961) would provide a starting point for 
such an analysis. 


The School Learning Situation 


Although psychological factors sometimes are confined to the learner 
and to the process of learning, other important variables of a psychological 
nature exist within the situation of the school. The teacher and his constel- 
lation of traits represent one dimension of such variation. The basis for 
selecting among many alternatives that are related to tasks and methods 
suggests a second domain pertinent to psychological factors. The entire 
area of programed instruction and teaching machines raises questions 
about tasks and methods. Group relationships form another important 
variable within the situation. Obviously, the writer cannot do more than 
point out the general availability of research in these different areas. 


The Teacher 


The long-awaited Handbook on Research in Teaching (Gage, 1963) 
contains many references to the recent research literature concerned wit 
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the teach is ivities i 
Mi aen «i and his role and activities in the classroom. This volume, which 
tedir- i 2 : 
edly will become a standard reference for research workers inter- 


) 
ested i i i i 
zin in teaching and school learning. gives the most extensive and com 
es ary reaeardec "TT i 4 
ummary of research currently available on teachers and teaching 
f= 
] 


Task and Method Variables 


mber 1961 Review on growth, develop- 


Since the publication of the Dece 
ppeared that offers conflicting or 


ne and learning, relatively little has a 
difere a m : d 5 A ME 
erent results concerning task and method variables. Two investigations 


(19 dm of the kind of research in this area. Ausubel and Fitzgerald 
62) reported a study of the effects of advance organizers, antecedent 


learni : ; A 
arnings, and general background knowledge in learning and retention. 
for individuals with low verbal ability, and 


; seemed to facilitate learning. In 
a function of interpolated activ- 
ag other things, that rest 


Organizers enhanced learning 
i» general background knowledge also 
PCM. of performance after failure as 

y, Smoot and Lawson (1962) concluded, amor 


a eee " à z 
a pa failure most effectively altered the influence of failure. 
wo investigations scemed to emphasize the importance of introducing 


appropriate task and method variables in the preparation of teachers. 
Wittrock (1962) concluded that set operated on the behaviors of teacher 
candidates. Davis and Bowers (1961) reported that data that indicated 
training of future teachers in terms of specific kinds of learnings appeared 


to have positive effects. If task and method variables were inoperative or 
v portat to either of these studies, it would appear that the results 
disce! 
escribed could not have be 


en demonstrated. 


Automated. Instruction and Teaching Machines 


Numerous questions rel ological forces and curricular 
decisions have been raised by the recent upswing of interest in automated 
Mstruction, self-teaching devices, and programed materials. In the April 
1962 Review, Silberman concluded that a greater burden of responsibility 
for learning would be placed on the quality of the instructional materials. 
Other recent reviews, including one by Morrill (1961), listed different areas 

Some of these problems have, of course, 


| r 
Sentence for a research program. 
een dealt with in research studies. An entire issue of the Journal of Edu- 
cational Research, edited by Lambert (1962), was devoted to describing a 


i number of specific studies on one or more of these topics. 
sides publication from the military setting reported a conference dealing 
oor programed learning jointly sponsored by the Office of Naval Research 
the System Development Corporation (Coulson, 1962). This volume, 


whi : 
hich offered additional reviews of research as well as papers related to 
and to the factors important to computer- 


a description of a computer-based labora- 
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tory for research and development in education. The summary of impli- 
cations of current research and future trends by Stolurow (1962) 
constituted an interesting and hopeful picture of developments in this area. 
However, one also should read the paper by Pressey (1963), who pointed 
out that current programmers in blindly pursuing their art have been mis- 
led by a theory derived from comparative studies. He contended further 
that write-in responses were not as valuable as objective items; that rein- 
forcement was not the significant process: and that programming was 
destructive of properties underlying a cognitive structure. 


Group Interaction 


Consideration of the school learning situation would be incomplete with- 
out some mention of the group forces operant within the school situation. 
A series of readings has been published by the National Training Labora- 
tories (Bradford, 1961) as Muman Forces in Teaching and Learning. This 
publication brought together a group of thoughtful and useful papers in 
one volume. 

Important reviews of research studies on group interaction have been 
written by Mann (1959) and by Glanzer and Glaser (1959, 1961). The 
Fifty-Ninth Yearbook of the National Society for the Study of Education 
(1960) also constituted a helpful reference source of material upon group 


processes. Many individual studies continue to be reported in the periodic 
literature. It seems that the substantial inform 


l ation already available will 
make possible the effecting of some valid curriculum decisions. 


Learning Outcomes 


Given information about the pupil, the learning process, and the com- 
plexity of situational variables, the generalized outcome of education 
appears to be the development of the intellective process, including the 
so-called higher mental processes such as thinking, problem solving, and 
creative thinking that can be described in a theoretical framework such as 
that posed by Guilford (1959). Other neglected but important outcomes 
include such skill areas as reading, listening, and oral and written expres- 
sion. 

In their investigation of divergent thinking abilities and teacher ratings, 
Klausmeier, Harris, and Ethnathios (1962) found that the nature of the 
divergent thinking factors and the expression of them in classroom activi- 
ties varied markedly according to sex and subject matter field. In an 
extensive study, Merrifield and others (1962) reported that no evidence 
of a unitary problem-solving ability was located—a finding which seemed to 
suggest that problem solving can be conceptualized in terms of factors in 
the structure of intellect. A noteworthy review of recent research on problem 
solving was supplied by Duncan (1959), who concluded, among other 
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things : z n . A 
ings, that the field of problem solving was poorly integrated. Perhaps, 
as integration is achieved, an increasing number of teachers and curriculum 
workers will become aware of the challenges and the issues posed by the 
exhibit regarding the nature of human thinking 


s 


concern that psychologis 
and problem solving. 

In reporting upon the results of three. experiments dealing with the 
icksen (1962) suggested the importance of generaliz- 
sss beyond the laboratory. It was contended that 
accrue from improved abstract 


abstraction process, F 
ing the abstraction. proce 
significant. educational benefits would 
learning abilities. | 

Much rescarch on creative thinking has been completed, and a consider- 
able portion has been included in two books by Torrance (1962) and by 
Getzels and Jackson (1962). Taylor and Holland, in the February 1962 
Review, furnished a summary of studies relating to measurement of 
creativity, 

In such outcome areas as reading, listening, and oral and written expres- 


sion, some reviews of recent research exist, but relatively few individual 
rized the investigations related 


studies are available. Harris (1963) summ: 
to reading, and Strom (1963) surveyed investigations in the language arts 
area. Petty (1962) gave some research direction to the study of listening 
skill: Brown (1962) developed an index of listening; and Andrews (1962) 
described a comparative study of methods of developing listening ability. 
aoe skill areas received little or no research attention in terms of well. 
designed, quantitatively oriented studies. . 
In an iE effort, Kelley and others (1962) studied the development 
of cooperation in a minimal social situation. Although they discussed some 
esearch, the authors did not place their findings 
hool situation. Such an atlempt to interpret 


o educators. 


Possible implications of ther 
Within the context of the sc 


their find: : 
ndings might be useful t : 
It is obvious that, although there is a great deal of research available 


With respect to many lines of inquiry, many areas that have been neglected 
need attention and clarification. Little research related to learning outcomes 
‘as dealt with the comparative effect of task and method variables on 
Carning. It is in this area, probably, that research workers can make 


as 1 H " 
ting contributions. 


Research Designs and Psychological Influences 

ter reading numerous individual research studies 
ask of integrating information is complex 
tinued development in research-design 
s Iciency seems to be dictated. Many studies showed a certain naiveté about 
research design and analysis. Fortunately, a number of recent publications 
maling with this important topic should help to develop future sophistica- 
i n. Campbell and Stanley (1963) wrote a lengthy and provocative chapter 
n the Handbook on Research in Teaching that was concerned with experi- 
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mental and quasi-experimental designs. This chapter should prove to have 
a great effect on the quality of future research, as should the chapter by 
Medlev and Mitzel (1963), who presented, among other things, a lucid 
treatment of analysis of variance involving complex components. 

Other complex techniques for analysis of data seem to hold particular 
promise for work in the curriculum area. Chief among these are the multi- 
variate methods that simultaneously consider a number of measurements on 
the same person. These procedures appear to hold thé key to an accurate 
analysis of the range of abilities as well as of the skills or learning out- 
comes encountered in school situations. An introduction to multivariate 
techniques may be found in a book by Kendall (1957). Some explanation 
and helpful listings of FORTRAN-coded computer programs were given in 
a recent book on multivariate procedures by Cooley and Lohnes (1962). 

A book by Winer (1962) containing many different designs should find 
wide usage among curriculum research workers. Particularly helpful was 
the detailed explanation of the meanings of interaction effects; the discus- 
sions related to the use of orthogonal components in tests for trend; and a 
series of illustrations and worked examples of nested factors, or hierarchal 
designs. This latter design, which is an often neglected analysis, is poten- 
tially useful because it can show the significance of differences for groups 
within methods, schools within cities, classes within schools, and the like. 

The papers by Kempthorne (1961) and Stanley (1961), which appeared 
in the Phi Delta Kappa symposium on research, should be particularly 
helpful in planning and executing research involving psychological factors. 


Conclusions 


Although the research in the area of psychological factors appeared to 
be extensive and the reviews of the research were numerous, much of the 
available information did not seem to have direct usefulness and applica- 
bility in the making of curriculum decisions. Some direct attack on this 
problem is needed. In addition, at least two questions should be raised 
regarding the currently available data: 


1. How does knowledge about psychological characteristics of learners 
become translated into patterns of action related to the curriculum? 


2. In what ways would any given learning theories result in curriculum 
differences, including a differential in learning outcomes? 


Although the newer statistical techniques may furnish valuable insights 
relative to the interpretation of complex data, development of a theory of 
curriculum is an important and needed concomitant to the realization of à 
coherent understanding of the implications to classroom learning that are 
present in the extensive psychological literature. The upswing in interest 
in educational problems, which has been evidenced by research workers in 
certain areas of psychology, should make the next few years productive in 
delineating psychological factors appropriate for curriculum decisions. 


274 


o iC s Voy + 
June 1963. Psycuotocicar Forces INFLUENCING CURRICULUM DECISIONS 


Bibliography 


Axprews, Donoruv E. "Comparative Study of Two Methods of Developing Musi 
Listening Ability in Elena ary School Children.” Journal of essere im [A 
q Education 10: 59-64; Spring 19 
bi a, Donan C. “Assumed, oals for Problem-Solving and Education: Chinks 
ree € Defense of Verbal Learning. Educational Theory 12: 226-29; October 1962. 
Association For SUPERVISION AND CURRICULUM DEVELOPMENT. Learning More 
About Learning. (Edied by Al «ander Frazier.) Papers and reports from the 
dined ASCD Research Institute. Washington, 9.C.: the Association, a department 
A b the National Education Association, 1959. 88 pp. , 
CONDON FOR SUPERVISION AND CurricutuM DEVELOPMENT. Freeing Capacity 
AS Learn, (Edited by Alexander Frazier.) Papers and reports from the Fourth 
SCD Research Institute. Washington, D.C.: the Association, a department of 
the National Education Association, 1960. 97 pp. 
SSOCIATION FOR SUPERVISION AND CURRICULUM DEVELOPMENT. Perceiving, Behaving, 
^ ming: A New Focus for Education. 1962 Yearbook. Washington, D.C.: the 
re ociation, a department of the National Education Association, 1962. 256 pp. — .. 
UsunEL, Davin P. “In Defense of Verbal Learning: Beams in the Eyes of a Critic. 
Educational Theory 12: 230-33; October 1962. 
USUBEL, Davin P., and FITZGERALD, Doxarp. “Organizer, General Background, and 
Antecedent Learning Variables in Sequential Verbal Learning." Journal of Educa- 
tional Psychology 53: 243-49; December 1962. 
EAUCHAMP, Georce A. Curriculum Theory. Wilmette, Il.: Kagg Press, 1961. 149 pp. 
ILODEAU, Epwarp A.; SULZER, JEFFERSON L.; and Levy, C. MICHAEL. Theory and 
Data on the Interrelationships of Three Factors of Memory.” Psychological Mono- 
graphs: Vol. 76, No. 20 (Whole No. 539), 1962. 22 pp. : 
RADFORD, LELAND P., editor. Human Forces in Teaching and Learning. Selected Read- 
ing Series Three. Washington, D.C.: National Training Laboratories, National 
ducation Association, 1961. 102 pp. —— 
TON. CianLes T. "Introductory Study of Breathing as an Index of Listening. 
peech Monographs 29: 79-83; June 1962. i i ; 
AMPBELL, Dowatp T, and STANLEY, JULIAN C. "Experimental and Quasi-Experi- 
mental Designs for Research on. Teaching." Handbook oj Research, on Teaching. 
(Edited by N. L. Gage.) A project of the American Educational Research Asso- 
ciation, a department of the National Education Association. Chicago: Rand 
McNally & Co., 1963. Chapter 5, PP- 171-246. " " m 
ARTON, Aaron S. and HivELY, WELLS, II, editors. “A Symposium: ‘Can the Laws 
o Learning Be "Applied in the Classroom?’ " Harvard Educational Review 29: 
-117; Spring 1959 
OOLEy, NE weal Lounes, P. R. Multivariate Procedures for the Behavioral 
Sciences, New York: John Wiley & Sons, 1962. 211 pp. " 
OULsoN, Jonn E. Programmed Learning and Computer-Based Instruction. Proceed- 
ings of the Conference on Application of Digital Computers to Automated Instruc- 
p lion, October 1961. New York: John Wiley & Sons, 1962. 291 pp... 
Avis, O, L., Jr., and Bowers, NORMAN D. “Group Sensitivity Training in a Teacher 
a ücation Program: An [Initial Attempt.” Peabody Journal of Education 39: 
"T4: September , TC n 
bow CM a a Research on Human Problem Solving." Psychological 
> Bulletin 56: 397-429; N ber 1959. i r 
RICKSEN, alley Ae dies in the Abstraction Process.” Psychological Mono- 
pE'tblis! Vol. 76, No. 18 (Whole No. 537), 1962. 28 pp. 
ARNSWORTH, PauL R., editor. Annual Review of Psychology. Palo Alto, Calif.: 
c, nmnual Reviews, Vol. 12, 1961, 533 pp. Voi. 13, 1962, 602 pp. Vol. 14, 1963, 454 pp. 
ACE, N. L., editor. Handbook of Research on Teaching. A project of the American 
Educationa: Research Association, à department of the National Education Associa- 
Gri. Chicago: Rand McNally & Co, 1963. 1218 pp... " 
ZELS, Jacon W., and JACKSON, Pamp W. Creativity and Intelligence. New 
ork: John Wiley & Sons, 1962. 293 pP- : 
h ZER, Murray, and GLASER, ROBERT- “Techniques for the Study of Group 
ructure and Behavior: 1. Analysis of Structure.” Psychological Bulletin 56: 


32; September 1959. 


Gra 


215 


Review or EDUCATIONAL RESEARCH Vol. XXXIII, No. 3 


Y Turray, and Graser, Rosert. "Techniques for the Study of Group 
Cy ed Behavior: II. Empirical Studies of the Effects of Structure in Small 
Groups.” Psychological Bulletin 58: 1-27; January 1961. NS š 
Graser, RoseRrT, editor. Training Research and Education. Pittsburgh: University 
of Pittsburgh Press, 1962. 596 pp. zi 
Gumrorp, J. P. “Three Faces of Intellect.” American Psychologist 14: 469-79; 

August 1959. 

Harris, Tigopong L. “Summary of Investigations Relating to Reading, July 1, 1961, 
to June 30, 1962.” Journal of Educational Research 56: 283-301; February 1963. 

Havicnurst, Rozert J., and oruers. Growing Up in River City. New York: John 
Wiley & Sons, 1962. 189 pp. " 

Mitt, Winirren F. Learning: A Survey of Psychological Interpretations. San Fran 
cisco: Chandler Publishing Co., 1963. 227 pp. 

Kewtey, Hanorp H., and orners. “The Development of Cooperation in the ‘Minimal 
Social Situation " Psychological Monographs: Vol. 76, No. 19 (Whole No. 538), 
1962. 19 pp. 

KEMPTHORNE, Oscar. “The Design and Analysis of Experiments with Some Refer- 
ence to Educational Research.” Research Design and Analysis: Second Annual 
Phi Delta Kappa Symposium on Educational Research. (Edited by Raymond 0. 
Collier, Jr, and Stanley M. Elam.) Bloomington, Ind.: Phi Delta Kappa 
(Eighth St. and Union Ave.), 1961. Chapter 4, pp. 97-133. 

Kenvatt, Maurice G. A Course in Multivariate Analysis. New York: Hafner Pub- 
lishing Co., 1957. 185 pp. 

KrauswEnER, Hervert J., and Cueck, Jonn. “Retention and Transfer in Children 
of Low, Average, and High Intelligence.” Journal of Educational Research 55: 
319-22; April 1962. 

Krausmeser, Herserr J.; Harris, Cuester W.; and ETHNATHIOS, ZACKARIA. “Rela- 
tionships Between Divergent Thinking Abilities and Teacher Ratings of High School 
Students.” Journal of Educational Psychology 53: 72-75; April 1962. 

Lambert, Pilip, issue editor. “The Teacher and the Machine.” Journal of 
Educational Research 55: 405-531; June-July 1962. 

Lockanp, Joan, and Sinowski, Josepn B. “Learning in Fourth and Sixth Graders 48 
a Function of Sensory Mode of Stimulus Presentation and Overt or Covert Practice- 

" d of — sd 52: 262-65; October 1961. 

owe, C. MansHarL. “The Self-Concept: Fact o ifact?” T zi ulletin 
SB: 395-36" July 1961. p ct or Artifact?” Psychological B 

Mann, Ricuamo D. “A Review of the Relationships Between Personality and Per- 

M rig s in Smell Groupa: ” Psychological Bulletin 56: 241-70; July 1959. 

Martin, Barclay. “The Assessment of Anxi + Physiological Belinvior: »asures." 
Perekotogicul Bulletin. 50: 29585. Fe d Physiological Behavioral Mea 

MEDLEY, Donato M. „and, Mirzet, llanorp E. “Measuring Classroom Behavior by 
Systematic Observation. Handbook of Research on Teaching. (Edited by N 
Cage) A project p x American Educational Research Association, a departmes 
ol he National Education Association. icago: . ; & Co. 
Chapter 6, pp. 247-328. on. Chicago: Rand McNally & Co i 

Metron, Artuur W. "The Science of Learnin ay Educationa 
Methods." Harvard Educational Review 29: put reuse d a m 

Merniristp, P. R., and orners. “The Role of Intellectual Factors in Problem Solving: 
Psychological Monographs: Vol. 76, No. 10 (Whole No. 529), 1962. 21 pp- 


MonniLL, CHARLES S. "Teaching Machines: iew." P. i lletin. 58: 
905-15. Septeaber 1961. ines: A Review." Psychological Bulleti 


Nationa SOCIETY FOR THE STUDY or Epucation. 7, ‘namics of Instructional 

gag Fifty-Ninth Yearbook, Part II. Chicago: Duet) af Chiceo Press, 1960 
pp. 

Nationa. Society FOR THE STUDY or Epucation. Individualizing Instruction. SixtY* 
First Yearbook, Part I. Chicago: University of Chicago Press, 1962. 337 pp. 

Nationa Society FOR THE STUDY oF Epucation. Child Psychology. Sixty-Second 
Yearbook, Part I. Chicago: University of Chicago Press, 1963. 550 pp. 

Perry, Water T. "Listening: Directions for Research." Elementary English 39: 
574-77; October 1962. 

Pressey, Stoney L. "Teaching Machine (and Learning Theory) Crisis.” Journal of 
Applied Psychology 47: 1-6; February 1963. 


276 


June 1963 PsvcnuoLocicarn Forces INFLUENCING CURRICULUM DECISIONS 


Smoor, Keynerit E., and Lawson, Reen. "Performance After Failure as a Function of 
_ Interpolated Activity.” Journal oj Genetic Psychology 100: 205-13; June 1962. 
SPENCE, Kennxeru W. "The Relation of Learning Theory to the Technology of Educa- 
_ tion.” Harvard Educational Review 29: 84-95; Spring 1959. , g 
SraNLEY, Jur Studying Status vs. Manipulating Variables.” Research Design 
and Analy. Second Annual Phi Delta Kappa Symposium on Educational Re- 
search. (Edited by Raymond O. Collier, Jr., and Stanley M. Elam.) Bloomington, 
Ind.: Phi Delta Kappa (Eighth St. and Union Ave.), 1961. Chapter 6, pp. 173-208. 
Srotvnow, Lawnrxcr. M. "Implications of Current Research and Future Trends. 
> Journal of Educational Research 55: 519-27; June-July 1962. . 
Strom, INcrip M. “Summary of Investigations Relating to the English Language Arts 
in Secondary Education: 1961-1962.” English Journal 52: 118-36; February 1963. i 
Torraxce, E, Pact. Guiding Creative Talent. Englewood Cliffs, N.J.: Prentice-Hall, 
1962. 278 pp. i : ” Harvard 
TEIRWOOD, Benton J. "Verbal Learning in the Educative Processes. 'arvar 
~@ucational Review 29: 107-17; Spring 1959. : T 7 
Hil’ Be: J. Statistical Principles in Experimental Design. New York: McGraw- 
ill Book Co., 1962. 672 pp. i ae : 
‘irrnock, M ar erg to Student Teaching.” Journal of Educational Psy- 
chology 53: 175-80; August 1962. 


mo 
-14 


CHAPTER V 


Curriculum Components and Organization 


RICHARD A. GIBBON 
STANLE 


', JOHN H. SANDBERG, DONALD M. CARROLL, 
N. MILLER, and NORMAN A. MILLER 


Tus CHAPTER is concerned with the evaluative research relating to cur- 
riculum components and organization that was completed during the 
period covered by this issue of the Review. Unimaginative descriptive 
reports and many studies uncritically designed and employing less than 
rigorous analytical techniques have been excluded. In preparing this 
chapter, the reviewers followed the procedures described below. 

After the standard compilations of educational research were surveyed. 
a tentative bibliography was compiled. This bibliography was supplemented 
by telephone conversations with persons actively engaged in major cur- 
riculum revision projects as a check on the literature survey. Standards 
were established by which each study in the tentative bibliography was 
judged. These standards included such criteria as the relevance of the 
problem to current curriculum development, the adequacy of the experi- 
mental design, and the appropriateness of the statistical treatment of the 
data. 

Each study in the bibliography was read in its entirety by the reviewers 
and by experimental rescarchers * on the staff of the Pennsyl vania De- 
partment of Public Instruction, who applied the standards previously 
developed. Through this procedure, most of the studies in the tentative 


bibliography were eliminated. Those that remained are reviewed in this 
chapter. 


Mathematics 


There were few significant curriculum studies in mathematics. Those 
who seek more comprehensive reviews of recent research are referred to 
the surveys by Brown and Kinsella (1960), Brown and others (1963): 
Feierabend (1960), and Kellogg and Johnson in the June 1961 REVIEW: 
It should be noted that these writers were generally unenthusiastic about 
the research they reviewed. For instance, Brown and Kinsella expresse 
reservations concerning the narrowness of the scope of, the limited dura- 
tion of, the small number of participating teachers in, and the restricte! 
areas represented by most of the studies reported to them. Feieraben 
noted that many studies were also poorly designed. 

The present chapter is confined to a few recent research efforts involv- 
ing some of the School Mathematics Study Group (SMSG) courses- 


* The assistance of Robert Hayes and Emanuel Berger is gratefully acknowledged. 
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While the studies reported here were not free from defects, they were 
significant attempts to contribute to knowledge in an important curriculum 
area. They also served to illustrate some of the difficulties inherent in 
conducting good curriculum research. 


Minnesota National Laboratory Studies 


egan a series of SMSG studies 


The Minnesota National Laboratory b 
by the Minnesota State 


in 1959, Reports have been issued periodically 
Department of Education, and these have been summarized by Rosen- 
bloom (1961, 1962). The major purpose of these studies, which involved 
thousands of students in grades 6-12, was to determine whether students 
in SMSG courses do as well as students in conventional courses on stand- 
ardized mathematics achievement tests. In addition, atlention was given 
to such matters as the effects of (a) SMSG courses on students of various 


ability levels, (b) certain teacher characteristics on learning, and (c) vari- 
ing of SMSG materials. Unfortu- 


us grouping arrangements in the teach i 
description of these studies nor 


nately, & : : 
nately, space permits neither adequate 


detailed discussion of the findings. . . 
Although several of the data are enigmatic, inconclusive, and even 


Contradictory, the following points emerge as tentative conclusions: 
(a) SMSG students do not appear to suffer on achievement tests, such as 
the Sequential Tests of Educational Progress: Mathematics (STEP) ; 
(b) SMSG materials appear to be suitable for students with wide ranges 
of ability; (c) SMSG texts do not seem lo place unusual demands on 
Students? reading ability: (d) in general, teachers can do a satisfactory job 
With SMSG materials without special training; and (e) there is no evi- 
dence that SMSG materials are better adapted to one type of student group 
than to another. 


_ Rosenbloom (1962) provided a critic 4 
ational Laboratory studies. He cited, among other things, (a) the lack 


of adequate measures of achievement for both conventional and SMSG 
mathematics, (b) the difficulty of controlling teacher variability, and 
(c) the lack of knowledge about the possible effects of one course upon 
another, when the same teacher teaches both. Probably, few would dis- 
agree with Rosenbloom’s contention that a properly designed experiment 
as not yet been effected. However, his revelation that SMSG and the 

Mnesota National Laboratory have worked out detailed plans for a 


Ive. » 
-year study is a source of encouragement. 


Additional Reference: Kraft (1962). 


al assessment of the Minnesota 


Educational Testing Service Study 


Another major effort to evaluate student achievement in SMSG courses 

Was carried out by the Educational Testing Service (ETS) during the 
) bn 
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1960-61 school year. As reported by Payette (1961). this study, which 
involved 300 classes at five grade levels (7, 9, 10. 11, 12) in 75 schools, 
was intended to provide evidence relating to the following questions: 
(a) Does the SMSG curriculum detract from student achievement with 
respect to traditional mathematical skills? (b) Does the SMSG curriculum 
result in a measurable extension of developed mathematical ability beyond 
that of conventional mathematics instruction? (c) How effectively is the 
SMSG curriculum communicated to students at various levels of scholastic 
ability? The following conclusions were derived from the data: (a) In 
general, students exposed to conventional mathematics had neither a 
pronounced nor a consistent advantage over students exposed to SMSG 
mathematics on achievement tests, such as STEP mathemati (b) SMSG 
students scored significantly higher on tests designed specifically to 
measure achievement in SMSG mathematics than did those students pur- 
suing a conventional curriculum; and (c) students of a wide range of 
ability levels appeared to profit from SMSG courses. However, Rosen- 
bloom (1962) criticized the ETS study on many of the same grounds he 
used in his evaluation of the National Laboratory studies. 


Other SMSG Research 


Two University of Minnesota doctoral dissertations provided additional 
data regarding student achievement with SMSG materials. In a well- 
designed but small-scale study, Shuff (1962) compared the achievement 
on standardized tests (STEP and Cooperative Mathematics Test [or 
Grades 7, 8, and 9) of groups of seventh and eichth grade students who 
had studied either SMSG or traditional courses Unlike the National 
Laboratory and the ETS studies, he found statistically significant differ- 
ences in favor of the students who had followed the traditional course. 

. In a companion study, Williams (1962) compared the performance of 
ninth and tenth grade students who had previously studied either SMSG 
or traditional courses. He concluded that traditional algebra texts ap- 
peared to be at least as effective as the SMSG materials in developing 
mathematical competence in grade 9, as measured by the Cooperative 


Algebra Test: Elementary Algebra Through Quadratics, and that tradi- 
tional and AMSG materials seemed equally effective in preparing students 


in grades 8 and 9 for succeeding mathematics courses, whether SMSG 
or traditional, Neither Williams nor Shuff found evidence that SMSG ma- 
terials at these grade levels are more suitable for high-ability students 
than for others. = 

The researchers whose work has been reported here would probably 
be the first to admit that the surface has barely been touched insofar 3$ 
evaluation of the SMSG courses is concerned. The studies carried out $° 
far actually are not evaluations of the SMSG courses. They are, for the 
most part, evaluations of student achievement on traditional standardized 
tests after using SMSG materials. It is necessary, as Rosenbloom (1962) 
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states, that decisions be made regarding the objectives of mathematics 
education and that adequate instruments for measuring achievement of 
these objectives be developed. Only when this has been done can research 
determine which approaches to the teaching of mathematics are the most 
effective. 


Science? 


This section will review four carefully selected studies which investigated 
pertinent problems related to national efforts to improve the science cur- 
riculum. The paucity of well-designed research concerned with significant 
curriculum problems revealed a glaring deficiency in research efforts over 
the past three years, even after allowance was made for the fact that some 
of the national curriculum projects have only recently been finished or are 


in their final stages of development. 


Physics 


e Physical Science Study Committee 
(PSSC) course in physics has opened new vistas for curriculum develop- 
ment and has provided a stimulus toward collaborative revision which is 
eing felt in most subject fields today. More research must be completed 
efore it can be known whether this course will achieve its objectives. No 
Significant study was found that attempted to measure the eflectiveness of 
the PSSC course in terms of its objectives. M : 

A well-planned and meticulously executed descriptive study by Trow- 
bridge (1961) is helpful in comparing the objectives of the PSSC course 
With those of the traditional physics course. A further analysis was made 
of the PSSC materials to determine their relevancy to the stated course 
Objectives, 

The investigator cited 72 objectives for high school physics based on a 
thorough ras of all the PSSC materials and interviews with the course 
author, A review of traditional physics textbooks and supplementary 
Materials and a survey of the literature of the past 10 years ient 
Source materials for the traditional objectives. Of the 72 objectives thus 


Secured, 17 were unique to the PSSC course, 17 to the traditional course, 
and 3g. tives. The reliability coefficient was satisfactory 


n 8 were joint objec ren emm 
?r the procedure followed in determining the objectives. P 
e unique PSSC objectives were related to such factors as the unifie 

Te ot the pen the emphasis on basic science rather than on applied 


nce, the reduction in the number of topics covered. and the oppa 
?n the laboratory as the focal point of learning. The unique traditiona 
a, Jectives emphasized consumer knowledge, the scientific method, the 
*Pplication of physics to technology, and laboratory work for the purpose 


281 


Since its inception in 1956, th 


Natu 
Scie 


Review OF EDUCATIONAL RESEARCII Fol. X XW. No. 3 


of verifying principles. Joint objectives were related to such areas as ap- 
preciation and interest, teaching techniques, and concept development. 

A questionnaire was sent to random samples of 100 PSSC teachers 
and 100 traditional teachers to determine the degree to which these teachers 
subscribed to the 72 objectives in belief and practice. The return was 85 
percent and 76 percent of the recipients of the questionnaire in the respec- 
tive groups. (Fourteen returns were analyzed separately to avoid con- 
tamination of the data by teachers who had attended PSSC institutes but 
were teaching traditional courses.) Six selected conclusions of this study are 
as follows: 


1. PSSC teachers did show significantly higher agreement with belief 
in the unique PSSC objectives than did traditional teachers in 64.5 percent 
of the cases, but significantly higher practice in only 47 percent of the 
cases. PSSC teachers did not strongly reject the unique traditional objec- 
tives when teaching practice was considered. 

2. Traditional physics teachers showed a tendency to subscribe to the 
unique PSSC and joint objectives in addition to the unique traditional 
objectives. 

3. PSSC materials adhered to the PSSC objectives and to the joint ob- 
jectives on such factors as (a) presentation of subject matter and (b) 
learning and teaching methods, on the basis of the percentage of these 
objectives that had rating values above the mean rating salis for all 
objectives. © 

4. Significant correlations indicated that both PSSC and traditional 
teachers strongly adhered to the unique objectives of their courses with 
respect both to agreement and to practice. 

5. PSSC teachers and traditional teachers differed significantly with 
respect both to agreement and to practice in their responses to the 72 
objectives. 

6. Unique traditional objectives received little attention in the PSSC 
materials (mean rating value of 1.20 compared with the over-all mean 


of 2.62). 


The minor conclusions of this study, which were concerned with the 
responses of the PSSC and of the traditional teachers with respect to size 
of school, teaching experience, and class size, are of doubtful value, because 


the data merely describe possible relationships. Moreover, the study was 
not designed to control relevant variables, 


Chemistry 


During the 1961-62 school year, Heath (1962a, b) completed two studies 
that compared the achievement of students takine newly developed chemis- 
try courses with that of students taking traditional chemistry courses. The 
new courses were those developed by the Chemical Education Material 


Study (CHEM) and by the Chemical Bond Approach Project (CBAP). 
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bod BEND and design of the two studies were similar. Control 
chers a mly selected from lists of secondary schools. The ex- 
perimental and control teachers were categorized in a three-variable 
stratification table: geographic region, size of community, and sex of 
teacher. The experimental and control groups were comparable on most 
relevant demographic characteristics. Data were analyzed by the analysis 
of covariance. i 

Significant differences on the Cooperative Chemistry Test were found 
between the CHEM classes (adjusted mean, 31.9) and the traditional 
37.7). with intelligence. as measured by the Co- 
ege Ability Tests (SCAT), controlled. This find- 
ause of the difference in content of the 


classes (adjusted mean. 
operative School and Coll: 
ing is not necessarily undesirable bec 
CHEM course. 


The comparison of the differences between the two groups on the CHEM 


final examination mean scores, with the effects of SCAT removed, cannot 


be stated with precision because one of the basic assumptions for 
adjusted mean for the CHEM group was 
13.2, However, the investigator stated that 
the difference between the adjusted means was “quite meaningful.” He 
noted, from an analysis of the regression lines, that the CHEM group’s 
advantage over the traditional group became increasingly larger in the 

or concluded finally that the CHEM 


upper scholastic range. The investigat th 
group demonstrated considerable skill on both the traditional and the 


CHEM final examinations. . 
The results of the Chemical Bond Approach Project (CBAP) study were 
dy. The traditional chemistry students 


similar to those of the CHEM stu 
vel of performance on the Coopera- 


achieved a significantly higher mean le l 
tive Chemistry Test than did the CBAP classes when the effects of SCAT 


were controlled. The respective adjusted means were 38.1 and 34.2. The 
Comparison of the two groups on the CBAP final examination was in the 
same direction as that reported above for the CHEM group on the CHEM 
final examination. In this case, the adjusted mean for the CBAP group was 
25.7; for the traditional group: 15.6. . . 
Heath's candor in pointing out the limited value of his studies to other 
Problems of curriculum development is commendable. Fewer studies are 
needed that deal with global "how much" questions, and more refined 
Studies are needed that deal with the qualitative “in what way” questions. 
uch a development would be most refreshing as a leaven for the many 
mediocre doctoral dissertations read. for possible inclusion in this chapter. 
DeRose (1962) was concerned with the possible transfer, between 
AP ciiemistry and PSSC physics, of attributes associated with the 
Process dimension of the two courses: Are knowledge, attitudes, and 
methods acquired in one course transferable to the other, as compared with 
the responses of students who were taking one of the new courses for the 


first time? 


covariance was not met. The 
25.7; for the traditional group, 
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Because of the relative recency of the curriculums and the particular 
hypotheses tested in this study, the sample selection procedures were too 
involved to be adequately summarized; however, the reviewers accepted 
the procedures used as appropriate to the conditions in existence at the 
time the study was done. The CBAP group consisted of 179 students; the 
PSSC group, 91. The sample was drawn from nine schools (six public, 
two university-related, and one private institution) in nine states. The 
students were similar on such factors as age, sex, and previous mathemati- 
cal and scientific experience. Intelligence was contolled by the covariance 
technique. Approximately one-half of the students in the PSSC and about 
one-half of the students in the CBAP groups had completed one year .of 
traditional chemistry and one year of traditional physics, respectively: the 
other half of the PSSC group had completed CBAP chemistry, and the 
other half of the CBAP group had completed PSSC physics. Appropriate 
tests developed by the ETS were used as the criterion measures. 

No significant differences were found in PSSC physics between students 
who had studied CBAP chemistry and students who had studied traditional 
chemistry. No significant differences were found in CBAP chemistry be- 
tween students who had studied PSSC physics and students who had studied 
traditional physics. 

Four factors should be considered in future studies that deal with prob- 
lems similar to those investigated here: (a) realization of adequate sam- 
pling; (b) validation of the criterion measures; (c) establishment of valid- 
ity of comparisons on the process dimension, as judged on such factors as 
objectives and course materials; and (d) control of teacher variables 10 
insure that teaching is in terms of the objectives of the particular course. 


English 


There has been much ferment in the field of English | 
Numerous articles discussing the confusion over desirable content and 
effective teaching techniques have appeared in professional journals. Sig 
nificant research studies, however, have been sparse. This is, undoubtedly; 
why Project English was instituted by the U.S. Office of Education. Stein- 
berg's (1962) report of the first Project English conference in May 1962 
gave some idea of the confusion that exists. The conference participants 
recommended that the following questions be studied: What grammar 
should be taught? What effect, if any, does the teaching of grammar have 


on the student's ability to read, write, and speak? How can teachers best 
motivate students of English? 


anguage instruction. 


Theory of Language Instruction 


Russell (1962) also discussed the confusion and pointed out the need 
for a research point of view as opposed to the opinion-oriented and careless 
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methods used by many critics and by several investigators of English 
teaching. Rinker (1962) agreed that confusion exists and listed several 
priorities in the study of English teaching. Rinker's priorities as well as 
the questions listed by Steinberg, however. serve to illustrate Russell's 
thesis that much of what is said about English teaching is based on opin- 
ion rather than on new research evidence. 

The confusion is compounded by arguments between traditional gram- 
marians and structural linguists. Some research studies failed to show a 
significant relationship between knowledge of traditional grammar and 
ability in reading, speaking, and writing. Structuralists have seized on 
this lack of research support for traditional grammar to argue that the 
linguistic approach, if substituted for traditional grammar in schools, might 
improve pupil performance in reading, speaking, and writing. "M 

Traditionalists and structuralists do not agree on the same major issues 
within their respective groups. Thomas (1962) reported that traditional 
English grammar textbooks do not agree on the number and definitions of 
Parts of speech. Thomas advocated a unification and simplification that 
may allow the traditionalists and the structuralists to join forces. 


Review of Studies 


nent literature, Alwin (1958) attempted 
at have been identified by the major 
led four devices or principles of 
Structural grammar: (a) word order, (b) word forms, (c) function words, 
and (d) intonation, Using procedures similar to Alwin s, 0 Donnell (1961) 
attempted to identify “essential structural pinapa : O0 Mox ER 
lwin's agreeme ajor principles lends authority to their work. 
s agreement on the major F 100 pages E d asal 


But the very fact that Alwin spent approximately nd 1 
40 pages to develop and defend these essential structural principles gives 


an indication of the embryonic nature of the principles. In more mature 
fields, such principles would require no extensive justification. 

Careful research into the relative merits of structural linguistic versus 
traditional grammar approaches has barely begun. (0) Donnell (1961) also 
analyzed the relationship between the reading comprehension of pupils 
and their awareness of the structural patterns of English. No standardized 
tests designed to measure the understanding of the patterns of English 
structure were available. Much of O'Donnell's effort, therefore, was directed 
to the development of a 50-item test'to measure recognition of structural 
relationships in English. Test reliability was established by standard 
Statistical procedures, including four administrations of the test to a total 
of 302 college and high school students. Each administration of the test 
Was followed by a statistical analysis to determine the level of difficulty and 
the discriminative value of each item. The test items were accordingly 
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revised. The final form of the test was administered to 106 subjects in the 
senior class of Bowling Green High School, Bowling Green, Kentucky. The 
results yielded a Spearman-Brown split-half reliability coeflicient of .88 
and a Kuder-Richardson interitem consistency coefficient of .86. The lack 
of randomness in selection of subjects limits the applicability of the study. 

This group of 106 subjects also took Test Cl: Reading Comprehension, 
the Jowa Grammar Information Test, and the Otis Quick-Scoring Mental 
Ability Tests. Various Pearson product-moment correlations were computed 
from these data. The following major findings resulted: (a) a correlation 
coefficient of .75 between scores on the grammar test and scores on the 
structure test, (b) almost identical coefficients of .16 and .41 between 
scores on the grammar and structure tests and scores on the reading com- 
prehension test, (c) a coefficient of .90 between scores on the grammar 
test and vocabulary scores, and (d) a coefficient of .46 between scores 
on the structure test and vocabulary scores, The correlation coefficients 
were significant beyond the .05 level. 

This relatively well-designed and well-conducted study indicated that 
awareness of structural elements of English and knowledge of the elements 
of traditional grammar had a significant correlation with reading compre 
hension. The high correlation of knowledge of grammar with vocabulary 
deserves further investigation. 

No instruction in linguistic analysis of English occurred in this study. 
The test attempted to measure awareness of structural patterns with sub- 
jects who had received traditional instruction in English grammar. Still 
to be answered are such questions as these: What would be the effect of 
varying amounts of instruction in linguistic analysis of English on reading 
comprehension and speaking and writing patterns? What is the relation- 
ship between writing and speaking patterns and awareness of structura 
English relationships? 

It appears that some of those teachers and research workers concerned 
with English language instruction must think through the assumptions 0D 
which English language instruction is based. If the major objectives of 
language instruction are related to appreciating the complexity of languag® 
to understanding the structural relationships of language, or to knowing its 
uses, such a curriculum may then be designed and evaluated. But, if 
instruction in English language is to lead to improved reading, speaking: 
and writing ability, much careful experimentation must be undertaken. 
Such investigations will examine alternative curriculum content along with 
alternative instructional methods and techniques in an effort to determine 
whether these abilities can, in fact, be improved significantly by instruction. 

On the basis of the research evidence and journal articles reviewed, there 
is little reason to assume that current objectives for English languas® 
instruction are sound or attainable. Moreover, it cannot be assumed either 
that current curriculum content adds to these objectives or that current 

instructional techniques are efficient. 
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Foreign Languages 


Menak = grapas — the teaching of ais languages 
vetns sabe jen try be: 100 * 1e program of teaching foreign lan- 
ee e elementary school is referred to as FLES. A study by the 
a Public Schools (1962) examined the effect of modern language 

udy in the elementary school on achievement in high school. Somerville 
High School received. pupils from elementary schools which offered a 
FLES program in grades 3-8 and from elementary schools which did not. 
ad comparing the over-all high school grade point average of 1,530 
"LES and non-FLES pupils, the investigators concluded that the FLES pro- 
gram did not have an adverse effect on the achievement of pupils of equal 
general ability. 

A comparison of 343 pupils who elected 
Suages in high school revealed that the grade point average in languages 
studied was not significantly different when FLES and non-FLES pupils of 
the same general ability were compared. The FLES pupils were, however, 
accelerated one year in French and in Spanish. 

Lopato’s (1961) study, which was conducted at the third grade level 
With two FLES and two non-FLES classes, attempted to determine whether 
à FLES program had an adverse effect upon pupil achievement in other 
e subjects. The experimental and control classes were not significantly 

ifferent in intelligence or social class b 
Nes cw achievement tests in the 
eee both groups at the beginning ane ^^. | 
‘NO significant difference in mean gain in achievement was found in favor 
of the control groups although a significant difference in arithmetic 
achievement was found in. favor of the experimental groups. Caution should 
be exercised in interpreting these findings, in view of the possible presence 
of both teacher variability and the Hawthorne effect. i 
i he investigators concluded that children in a regular third grade class 
he furnish evidence of satisfactory versational French 
of ih they have been taught by qua 
Vu year, each pupil was rated : 
Mech by a panel of judges. Re d p 

vement was considered established if there was 
agreement among the judges on each rating. 


a " was also concluded that pronunciation 
"ldren were superior to the results generally 


"i£h school students after an equivalent numbe 
1l evident. 


alth d : 
ough wide variations were SU 


1 the third level of foreign lan- 


ackground. 
basic skill subjects were admin- 


d end of the experimental period. 


achievement in con 
lified classroom teachers. At the end 


n comprehension, fluency, and pro- 
bility of the scale for rating French 
at least 67 percent 


and fluency of the third grade 
obtained with beginning 
r of hours of instruction, 


Social Studies 
which hopefully may give direction 


Sever " " 
veral major developmental studies, ] ; 3 } 
] studies curriculum, are in their 


ot 3 
Bor development of a modern socia 
ning stages, 
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One study should be noted because it bears upon an important question 
that has been raised relative to the teaching of chronology in the elemen- 
tary grades. Arnsdorf (1959) conducted an experiment with 563 sixth 
grade subjects to determine whether children of this level might be helped 
in their understanding of several measures of chronology through an 
instructional program which stresses activities dealing with chronology. 

The investigator concluded that children at the sixth grade level might 
be helped in their understanding of several measures of chronology through 
an instructional program. A further conclusion was that the upper ability 
group appeared to be most successful in its performance on measures of 
chronology. 


Additional References: Fraser (1962) : School Review (1962). 


Curriculum: Organization 


During the late 1950’s and early 1960's, certain curriculum organizalions 
were suggested which were basically different from those used in previous 
decades. Trump (1960), among others, emphasized such techniques as 
team teaching, large- and small-group instruction, and flexible scheduling: 
Goodlad and Anderson (1959) related continuous pupil progress to ele- 
mentary school organization. These organizations were discussed with 
increasing frequency in the literature during the period from 1959 to 1963. 
Most of the literature, however, was not based on systematic research 
studies which evaluated the ideas presented. Two well-designed studies 


have been selected for review. Unfortunately, both apply to the secondary 
school, 


Team Teaching 


Taffel (1961) evaluated a team m 
high school physics to academically 
the achievement of the experimental c 
that of the control classes taught by th 
The study also analyzed the attitude 
team method. 


oi M Ji i 7 H H - 5 

Fifty five pairs of students were obtained by matching age, sex, grade; 
science-mathematics scores, intelligence, and scores on the Dunning Physics 
Test. One student from each pair was placed in the experimental class, a” 
one student from each pair was placed in a corresponding control class. 


There were no significant differences between the two greups on the 
criteria used in matching. 


ethod of teaching a first course. in 
talented students, through comparing 
lasses taught by the team method with 
1e conventional single teacher method. 
s of students and teachers toward the 


A teaching team was formed consisting of the investigator, who was a” 
experienced physics teacher and science supervisor, and two science 
teachers, who were comparable to each other in experience and education: 
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The investie. 
Titi nh hip em gun fe emt eet 
conventional classes. Eacl of s remainder of tip mam program A tas 
aneo iones ^ x Pach of the team teachers also taught one of the 
ud. soie ue owing the traditional pattern of the school. Time, 
Sonys iar : ie aa coverage, and scheduling were the same for both 
awe. p r the large group lectures received by the experimental 
"e woran measures were the Dunning Physics Test, the Bronx High 
ferens ee midyear examination in physics, and the New York State 
differences veh ie in physics. The investigator reported no significant 
The reenit d ement between the experimental and control groups. 
bah aif * a gear eem however, indicated favorable attitudes by 
etde T s an teachers toward the team teaching approach. Taffel 
hat team teaching is a practical alternative to the conventional 


method of teaching physics to academically talented students. 

at team teaching is characterized by 

ruction in addition to other 
d to planning was minimal, 

A question may be raised, 
in fact, an evaluation of 

valuation of large-group 


nm reviewers believe th 1 

A n iii planning and by small-group inst 

aid am pei is In this study, the time devote 

therefore na group instruction was reported. 

team on ping: whether the study was, 

itinere ling, or, what appears more likely, an ev 
n. 


Trac i 
ack Organization 


* McCown (1960) evaluated the influence of grouping practices on the 
ac. " $ H i 

psa of students in the honors and in basic curriculum tracks of 
a s H H H H 1 

Senior high schools in the District of Columbia. That portion of the 


Study M : . 
ly concerning the honors track is reviewed here. 
track curriculum in the tenth 


ea eria of Columbia initiated a four A l 
iles st t Ssptembas of 1956. McCown compared 4f TUE ain gradu- 
Who w 1959 with a matched group of graduates from the class of 1958 
oma heterogeneously grouped. Groups were matched on intelligence, 
equal ^ level, and arithmetic achievement. In the matching procedure, 
stud numbers of honore track students and of heterogencously grouped 

ents were taken from six high schools to form the experimental and 


le Contro] groups. 

Ne College Entrance Examination Board Scholastic Aptitude Test and 
owa Tests of Educational Development in English, social studies, 
* Science, and mathematics Were given to the members of me) 
c control groups during their senior year. The i ences "d a 

were significant in favor of the honors track group. Other data 
s, class grades, 


ere 
rese : : as 
Presented pertaining to Merit Scholarship semifinalist 

ndings based on these data were not 


Dd sty 
Clteq, ident and teacher opinion. The fi e i c 
° ecause they were not related to the stated problems investigated, in 
j ere not treated statistically, 


Stane fa " 
Ce of student and teacher opinion, or W 
989 


Natur. 
Ment, 
Meas 
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in the case of class grades. Findings based on the Merit semifinalists are 
not reported, because the investigator included all scholarship holders in 
1959, whether or not they had taken the honors track program: therefore, 
the increase in Merit semifinalists could not necessarily be attributed to 
the influence of the honors track program. 


Recommendations 


One conclusion must be stated briefly based on an analysis of studies 
reviewed for this chapter: Many doctorates are being awarded for poorly 
designed research about insignificant problems. That so few doctoral 
studies, of those read by the authors, were included in this chapter is 
eloquent testimony to this statement. This shocking lack of discrimination 
on the part of some faculties not only misleads the degree candidate but 
also retards the development of the profession by dulling its intellectual 
cutting edge. 

Since few studies, doctoral or otherwise, are related to a theoretical 
framework of learning or teaching or to a comprehensive curriculum 
design, the separate studies, when viewed as a whole, appear as sounding 
brass and tinkling cymbals. Too few studies are replicated; too few studies 
have an end beyond themselves. One recommendation might be for more 
graduate schools to plan a series of doctoral studies dealing with one 
aspect of a significant problem. The general lack of a true residence T€ 
quirement for doctoral students probably is related to the poor quality ? 
the research surveyed. 

This journal could perform a vital service by rigorously selecting fewer 
studies for review, by analyzing each study in depth, and by candidly 
reporting its strengths and weaknesses. Exclusion from the Review woul 
increasingly become the vehicle for vigorous internal professional criticism. 
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CHAPTER VI 


Teaching 


ROLLAND CALLAWAY and GERALD T. GLEASON 


Is tue June 1900 Review, which was devoted to curriculum planning 
and development, Bellack and Huebner presented a rationale for the in- 
clusion of a chapter on teaching. They indicated that, although teaching 
is closely related to curriculum, little progress had been made in the 
direction of development of a theory of teaching as a vehicle for the im- 
plementation of curriculum. It is encouraging to note that significant 
progress has been made in this direction during the past few years. A 
number of studies included in the section on analysis of the teaching act 
report the development of theoretical constructs and instruments for the 
description and analysis of teaching. 

Other studies have been concerned with an examination of the role of 
Personality as well as with problems in teacher recruitment, selection, and 
effectiveness. Closely related are the studies on the characteristics of 
teachers in the broader personal, social, intellectual. and cultural aspects. 

The reader who is interested in a comprehensive analysis of this topic 
Will be aided considerably by three recent major publications. The Hand- 
book of Research on Teaching (Gage, 1963) presents a scholarly and 
critical analysis of the literature, including sections on theoretical orienta- 
tion, methodology of research. on teaching, major variables, teacher 
Personality and characteristics. and methods of teaching various grade 
levels and subject areas, This volume is likely to be an invaluable source 
Or many years. An entire issue of the Journal of Experimental Education 
(Barr, 1961) summarized much of the research completed within the past 
decade on prediction of teacher success. Highly significant was Ryans’ 
(1960) comprehensive study on teacher characteristics and personality, 
Each of these sources, which represent comprehensive and authoritative 
Summaries and svntheses of research on teaching. should enable researchers 
and curriculum workers to achieve greater understanding and to develop 
Penetrating insights into the nature of te 


achers and teaching. 


Analysis of the. Teaching Act 

igned to develop methods and instruments for 
ade encouraging progress during the 
and Mitzel (1963) summarized the 
behavior based on systematic 
either lack of, or multi- 
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plicity of, a concept of teaching: by problems of instrumentation, design. 
and replication; and by the changing public and professional image of 
the ideal teacher. 

Flanders (1961) continued to develop his interaction analysis, an obser- 
vational technique which classifies verbal behavior into categories that have 
been used to analyze the degree and effects of direct and. indirect teacher 
influence as well as a range of teacher and student behavior. Through the 
use of an ingenious analysis matrix, Flanders was able to specify and 
quantify a number of significant acts of the teacher that conceivably 
influence achievement and adjustment of the student. Flanders also used 
the method of interaction analysis as an in-service tool for helping teachers 
analyze and change their classroom behavior. 

As part of a study on assessment of quality of teaching, Hughes (1902) 
compiled and analyzed verbatim records of a range of teachers judged 
successful and less successful. She identified 31 specific functions under 
seven broad classifications: control, imposition. facilitation, development 
of content, personal response, positive affectivity, and negative affectivily- 
These categories and functions, which were used to describe and clas: fy 
verbal behavior in the classroom, were analyzed in terms of a number of 
significant aspects of content and curriculum implications. 

Medley and Mitzel (1958) continued development of an instrument 
called the Observation Schedule and Record. Originally designed to pro 
vide quantitative data on behavior of beginning teachers, the instrument 
has been revised and refined to increase observer accuracy, increase efli- 
ciency of use, and separate the process of observing from the process 9 
scoring. Factor analysis of the data yielded largest loadings on the factor? 
of emotional climate, verbal emphasis, and social organization. . 

Smith (1961) focused directly upon the logical structure of a teacher * 
verbal behavior. Through separating the teaching act from learning aní 
through analyzing the recordings of verbal intessctien in high schoo 
classes, he identified two categories of teacher verbal behavior—perform- 
tive and expressive. Wright (1959) developed an instrument for analysis 
of the verbal interaction of mathematics teachers, and Brown (1960) used 
the instrument designed by Withall (1951) to study the behavior of third 
grade teachers, á 

The problems associated with the development of instruments for 
measurement of classroom behavior were summarized by Medley anc 
Mitzel (1963). At first glance. they would appear to be insurmountable. 
The reliability and validity of observations as a means of assessing teach- 
ing effectiveness were studied by Groff (1961). who found that college 
supervisors agreed with the student teacher's self-imaze only slightly more 
than one-half of the time. Veldman, Peck, and McGuire (1961) have 


suggested that the individual construct Kwdiatns of judges or raters ? 

human behavior are significant variables in the research. process. 
However. the promising insights these and other researchers have pr? 

vided into the nature of the teaching act represent real progress towa!" 
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an improved understanding of the task of teaching and the eventual 
development of a theory of instruction. 


The Teacher and Teaching Effectiveness 


Personality 


Much of the research on teaching has been based on the premise that 
personality. These 


rsonality pattern or 
diction of teaching 


teaching effectiveness is a function of the teacher's 
studies have been concerned either with the total pe 
with specific personality traits as related to the pre 
success or effectiveness. It should be recognized that any study concerned 
with the personality of the leacher is generally based on a stated or 
implied concept or on an operational definition of teaching. In this con- 
text, Barr (1960) asked whether the techniques of teaching that grow 
out of learning theory are encompassed by the behavioral aspects of 
personality. or whether the behaviors found in the techniques of teaching 
are something different. Jensen (1961) suggested that there are a limited 
number of nonoverlapping critical qualities and that each quality is 
essential. Thus, the good teacher would possess at least one superior 
quality and would not fall below average on any others considered 


necessary. 

A major problem in research on teaching has been the development of 
instruments or scales that measure personality patterns or traits and that 
might be correlated with teaching effectiveness. For example, Sheldon 
(1959) indicated that “fakability” should be considered as well as relia- 
bility and validity in the development and use of teacher-selection inven- 


tories, 


Among the most popular instruments that have been utilized are the 


Minnesota Teacher Attitude Inventory (MTAI) and the Guilford-Zimmer- 
man Temperament Survey (GZTS). Popham and Trimble (1960) hy- 
Pothesized that the MTAI discriminates between superior and inferior 
teachers and concluded that the MTAI can be used as an indication of 
teachers’ general competencies as well as an index of the types of social 
atmosphere that teachers maintain in the classroom, In another study, 
Cook (1962) compared MTAI and GZTS trait correlations for a group of 
beginning teachers, student teachers, and in-service teachers. Gruber (1960) 
also utilized the MTAI and the GZTS in a study concerned with the 
Personality traits and attitudes of undergraduate majors in, and teachers 
of, physical education. Gowan (1960) attempted to develop a screening 


scale related to the GZTS as a prognosis of teaching effectiveness. 
In the use of another instrument, the Taylor Manifest Anxiety Scale. 


utton (1962) concluded that student teachers can be differentiated in 
relation to their ability to be influenced and to their attitudinal changes 
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toward youth. Lang (1900) used the Edwards Personal Preference Wo 
ule and the Lang Scale of. Motives for Teaching in his study of tent ler 
selection. Scandrette (1962) also employed the Edwards Personal Eroe 
ence Schedule to determine whether women students preparing to Leno 
on the elementary level have different need patterns from students prepar- 
ing to teach on the secondary level. Garrison and Scott (1961) used the 
same scale in a similar study. 

The Minnesota Multiphasic Personality Inventory (MMPI) has also 
been administered in a number of studies concerned with teaching effec- 
tiveness. Flanagan (1961) compared scores on the MMPI with teaching 
success indicated by supervisory ratings. Utilizing the MMPI and the 
group Rorschach scale, Cole (1961) found that they correlated signifi- 
cantly with success in teaching as rated by a classroom observer. A 

An example of the self-report approach to research on teaching Was 
Hatfield’s (1961) study, which attempted to determine the relationship 
between self-concept and successful performance in student teaching, 
Also in this category was Peck's (1960) investigation in which he obtained 
personality analyses from sentence completion data. . 

Another approach to research on teaching is concerned with the study 
of specific personality traits. For example, Dixon and Morse (1961 ) 
found that teachers who have high emphatic capacity, as indicated by 
pupils, are seen also as better teachers. Reed (1961) summarized four 
independent studies concerned with teacher warmth and competency. In 
another investigation concerned with a specific personality trail, Doge 
(1961) studied the egocentric teacher and altruistic teacher. Powell an¢ 


: esi : : somen 
Ferraro (1960) examined sources of tension in married and single wome 
teachers. 


Characteristics 


In general, the studies concerned with characteristics of teachers repre 
sent attempts to relate either a specific attribute or a variety of charac- 
teristics of teachers to teaching effectiveness as measured by observe” 
ratings or pupil gain. Perhaps the traditional approach to the investiga" 
tion of teacher characteristics has been the development of lengthy a” 
comprehensive lists of specific qualities that are judged to be related to 
teaching effectiveness. The study of Cassel and Johns (1960), in which 
they examined the critical characteristics of effective teachers as assess° 
by school administrators, is an example. Tyler (1960) noted the almost 
endless number of characteristics as one of the problems in this approac 

The work of Ryans (1960), as described in Characteristics of Teachers: 
represents one of the most comprehensive studies in this area. Ryans 
(1961b) also reported on a study of relationships between an observer £ 
assessment of classes of pupils and teachers and the classroom behavior E 
pupils and teachers. In another study, Ryans (1961a) related certai! 
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teacher characteristics, as indicated by scores on a self-report inventory, 
to pupil behavior in the classroom and to pupil-teacher relationships. : 

In another comprehensive study in this category. Heil and Washburne 
(1962) hypothesized that there was a relationship between the changes 
that take place in children under the influence of teachers and the types 
of teachers and children involved. They found that there are identifiable 
types of teachers and that different types of teachers have different effects 
on the children they teach. ) 

One of the persistent questions related to this category concerned the 
authoritarian or democratic behavior of the teacher. After reviewing 49 
experimental studies in which authoritarian leadership was compared with 
democratic leadership, Anderson (1959) concluded that the evidence did 
not demonstrate that authoritarian or democratic leadership was con- 
sistently associated with higher productivity. In relation to this question, 
Johnston (1959) studied personality differences between autocratic and 
democratic teachers and concluded that democratic teachers had more 
basic social-emotional security. In another study, Christensen (1960) was 
concerned with the relationships between permissiveness and warmth of 
teachers and affect-need and achievement of pupils. Medley and Mitzel 
(1959) also considered behavioral correlates of teacher effectiveness. 

A number of studies have been reported that were concerned with 
students’ assessments of teacher behavior and effectiveness. At the college 
level, Drayer (1961) investigated students’ views of the qualifications of 
their teachers, Likewise, at the college level, Cosgrove (1959) developed 
à technique for evaluating the effectiveness of the instructor by the student. 

Included in the studies of teacher characteristics are those that compare 
the characteristics of teachers with those of a general population. For 
€xample, Mitzel and Dubnick (1961) reported on a persisting problem— 
the scholastic ability of prospective teachers as compared with that of 
iberal arts students. They indicated that perhaps teachers are not repre- 
Sentative of the general scholastic ability available. . u 

Finally, in relation to teacher characteristics, are those studies pertaining 
to general background, choice of teaching as a career, selection of teaching 
area or level and persistence in teaching. In this category, Rabinowitz 
and Crawford (1960) made a comprehensive investigation of the teach- 
Mg careers of graduates of the municipal colleges of New York City 

teaching and persisting m teaching 


ax ree tering 
ne factors related to entero i 
cmn Kleinman (1960) examined the adjustment 


heir first year of teaching in relation 
to them as they selected their first 
to selection of teaching level, Fox (1961) 
; he choice of elementary or secondary teach- 
mg, and Jones and Gottfried (1962) attempted to identify the interrela- 
'ionships between preferences for teaching various types of exceptional 
children, I 
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Conclusion 


Review of the research on teaching during the past three years indicates 
a number of promising developments growing out of the work of several 
researchers on the nature of the teaching act. These studies might be 
described as micromatic approaches, because they attempt to identify and 
classify specific observed behaviors and use such classifications in an 
atlempt to develop a theoretical base for the amorphous act of teaching. 
Similarly, the studies reported on teaching effectiveriess and characteristics 
of teachers typically utilize an approach which assumes that such charac- 
teristics, while obviously interrelated. can be identified and isolated, at 
least statistically. Identification and cultivation of those characteristics 
that tend to be related to successful teaching can only lead in the direction 
of more successful predictions of success in teaching as well as appropriate 
adaptation of preparation programs. 

The tasks remaining are frightening in their complexity. However, 
through continued research designed to move us from empirical evidence 
toward postulation of theoretical constructs and toward the systematic 
formulation of theories of instruction from which deductions may be 
tested, the profession will be able to define its role and responsibility 
effectively. From this perspective, signifie 
shape—beginnings which can be 
educator, 


a a taking 
ant beginnings seem to be taki 4 

-ientec 
a source of hope to the research-orient 
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CHAPTER VII 


Curriculum and Communications Media 


CURTIS PAUL RAMSEY 


Qoxsisicinos, curriculum, and teaching are words filled with many 
meanings and are yet imprecise. When effective teaching is equated to 
a successful communicative act, how does one know for sure that one has 
communicated? For that matter, how does one know for sure that GB 
has taught any particular thing? 

In the June 1961 issue of the REVIEW, Shafer discussed strategies in 
communications research, limited status of research methodologies and 
techniques, and effects of mass communication on children. The implica- 


tions of communications media for curriculum are obviously great. But 
he effect of the media on 


what research evidence exists concerning t 
scems seriously concerned 


curriculum? (Incidentally, almost no one 
about the effect that curriculum might have on communications media!) 


As yet, criteria for the determination of the degree of success of trans- 
mission of meaning from one mind to another are nebulous and inadequate 
More is known about what contributes to failure in communication and 
in teaching than about what contributes to success. Theoretical pro- 
posals are far in advance of empirical research findings. Concerning this 
point, Carpenter (1962) warned that it is erroneous to conceive of 
communication processes and content as “transmissions of energy” that 
will lead directly to a predetermined response int 


he individual who has 
been stimulated, regardless of the nature of the mediation processes 
involved. 


Technology of Teaching 


igw devoted to a consideration of 
962 issue of the REVIEW concerned 
nderlying curriculum and 


instructional media were noted. Finn (1960) insisted that an emerging 
technology of teaching must be reckoned with today. Moreover, he stressed 


that the implications of parallel but opposite trends in mass and indi- 
create a riptide of confusion in the curriculum, 


vidualized instruction may à TUE = 
Of even greater danger is the possibility that the design and control of the 
f legitimate educational authori- 


curriculun: may be taken from the hands o 
ties through deliberate short-circuiting by foundations and other agencies. 


Articles and admonitions about the role of communications media and 


their impact on curriculum are many. However, an entire school of media 
for research purposes—that the individual learner 


research has denied— 
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is a mediator in the communications paradigm. RIS uS 
haviorism, which has not addressed itself to the dynamics of ind ME 
perceptions of mass media stimulation, has instead focused myopica ly - 
a small-step, atomistic presentation and on response modules. Recent 5 
Jordan (1963), who challenged even the Socratic premise of programe 
instruction, found the method far inferior to its presumed model. 


Merchandising and Research 


In short, there is more sound and fury in the field of communications 
and curriculum than there is substance. Whether one adheres to the posi- 
tions taken by Finn (1960), by Ramo (1960), or by Bushnell and Cogs- 
well (1961), the impact of discussion is not so much a million-pound 
thrust that can be attributed to an inexorable technological advancement 
as it is a million-dollar thrust built upon a capital investment. 

Finn and Perrin (1962). who documented the tremendous growth of the 
field of programed instruction and teaching machines, found that there 
were 104 companies in various stages of preparation or production of 
devices and programs in 1962. Perhaps as few as 11 of these companies 
existed a decade earlier as publishers of films. Part of the enthusiasm and 
advocacy of new technology for teaching appears to he self-generating 
and self-reinforcing, Possibly to furnish some degrce of control upon the 
runaway trend, Soles (1963) reported the development of criteria useful 
to distinguish between valuable educational services and commercial 
quackery, 

However, the programed instructional devices must be viewed as more 
than new audiovisual aids. So potent and so well designed are the better 
programs that they need to be considered in a new curriculum design- 
construction-control perspective, Gilpin (1962) has taken the position that 


a truly systematic concept of instructional technology is possible; others 
have indicated a similar orientation. 


The special significance of programe 
lioned response mechanism, but rath 
performance designation to 
sharply identified in program 


d instruction is not in the condi- 
er in the necessity for criterion 
üssess learning progress. This concept, so 


ming necessities, will force a more rigorous 
and better defined approach to designation of teaching tasks in other 


forms and other modes of instruction than have existed, Perhaps this 
movement will bring about the demise of those blunt and inane research 
designs that measure traditional teaching against some new fad or pro- 
cedure whose terms are as Vague and ill-defined as those used in traditional 
or conventional teaching. The teaching profession is surfeited with pseudo- 
research reports comparing total systems of instruction as if the only 
variable involved is a distinction between two global concepts. 

In brief, there is scarce justification for optimism in the field of research 
related to curriculum and communication. Most project reports are self- 
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ania acknowledgments of a preconceived bias toward an innovation. 
: any projects, which, in fact, are not rescarch-oriented at all, constitute 
demonstrations—or, at best, feasibility studies—of an advocated position 


or procedure. 


Meierhenry (1962) cautioned that, although much research is currently 


in progress, the prime requisite of any researcher is the ability to ask the 
right questions. Unfortunately, the more important questions have been 
The most important research questions to be asked at 
the moment seem to-be related to (a) characteristics of individual learners 
and their approach to learning: (b) characteristics and limitations of the 
media, systems, and mediational combinations: (c) role of the personal 
teacher as a mediating influence; (d) interactional dynamics of the above 
e) development of curriculum communication theories, 


neglected too long. 


three areas; and ( 
models, and instrumentation. 

In communications theory and research related to formal curriculum 

a beyond the 


t investigators have proceeded scarcely 


purposes, it seems tha 
witchcraft stage. 


Additional Reference: Elam (1963). 
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CHAPTER VIII 


Curriculum Development: Dynamics of Change 


PAUL R. KLOHR and JACK R. FRYMIER 


A CHAPTER on the dynamics of change in a review of research on curricu- 
lum planning and development might appear, offhand, to be unwarranted 
duplication, since each of the other chapters is also clearly concerned with 
change. Yet a survey of the literature of the last three years suggests the 
need for such a category if for no other reason than to focus attention 
on a crucial aspect of the total curriculum development process which will 
require increasing investigation in its own right if curriculum research is 
to mature and to take its place with other behavioral sciences. 

In effect, this chapter is based on the assumption that certain dynamics 
of change in the curriculum development process can be identified and 
that research into these aspects of change will help both to describe what 
is happening and to predict what will h 
Review, this topic was subsumed under 
implementation of curriculum dey 


on administrative, supervisory, and organizational arrangements. 
Throughout the literature are persistent pleas for more adequate con- 
ceptual systems and improved theory. In the study of curriculum change 
itself, some notable beginnings were made during the period under survey. 
The state of the field is such that no tightly drawn categories can be used 
to organize the research under purview, However, the studies reviewed did 


suggest the following classification: search for models; diffusion of innova- 


tions; field demonstrations; redefinition of change; roles, role perception, 
and personality structure in change; and contributions from related fields. 


appen. In previous issues of the 
topics dealing primarily with the 
elopment, in which emphasis was placed 


Search for Models 


In the quest for more adequate models for th 


à e study of curriculum 
change, the volume of readings in the a 


pplied behavioral sciences edited 
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"Wn Ricard E outcome of planned change as projected by these 
sts rests basically on the relationship established between the giver 
i dane EX helis. How well the: process is understood by each and 

1 gre penness for examination and for possible reconstruction 
exists for both parties are, therefore, of central importance. Selection after 
selection in this volume, ranging from abstract considerations to research 
reports with detailed analyses of methodology, supports, in a strong, con- 
sistent way, the position taken by the editors. 

Kushner and others (1962) developed a propositional inventory of 
hypotheses accounting for sociocultural change. Four hundred logically 
ordered hypotheses drawn from ethnographic studies and from statements 
of theory were organized into 13 categories. Each category is made up of 
a series of propositions illustrated by hypotheses. For example, one category 
holds together a number of propositions that account for the acceptance or 
rejection of an innovation through congruence or compatibility, Another 
category groups propositions that suggest techniques and motivations used 
by change agents to introduce innovations. Although this inventory was not 
seen by its authors as a theory of sociocultural change, jt can serve a 
useful function as a general conceptual framework for examining curricu- 
lum change and for designing research to probe more deeply into this 
component of the total curriculum development process. 

Serving a similar purpose is the work of Barton (1961a), who used a 
general systems approach to classify a number of studies about the 
behavior of organizations, including schools. His classification was made 
to show some principles of measurement applied to complex organizations. 
Concerned with the logical nature of the measurement operations that 
might be used to study college environments, he saw an understanding 
of these as a necessary condition for the designing of more effective 
measurements. His classification involved six categories: inputs, outputs, 
environment, social structure, attitudes, and activities. Although Barton 
asserted that he did not intend his classification to be a theory, it too 
Suggests a general model for undertaking research into the nature of cur- 
riculum change within a school or school system. 

George Maccia (1962) used von Bertalanffy's general systems theory 
to develop a nonrepresentational model for the study of we The 
Beneral systems theory builds on the ideas of yes iiy an : on inter. 
actions among the parts of a system. With his model, w m makes central 
use of the concepts of input and output, Maccia generated an outline of 
an educational theory. Four subsystems were identified within the system: 
administering unit, facilitating unit, and inquiry unit. This 


*eaching unit, 
model and its related theory ing approach to the 


represent a promis 
complex problem of assessing change in education. Moreover, they throw 
light on the special task of measuring and evaluating curriculum change. 
A similar contribution was made in the work of Elizabeth Maccia (1962) . 
Who used graph theory to develop a model and to translate it into an 

n effect, graph theory was used to represent 


outline of educational theory: ! 
305 


Review or EDUCATIONAL Rese: Vol. XXXII, No. 3 


characterization of the relations of persons. in groups within a school 
usi Lois " 
uo (1901) outlined a theoretical model of organizational behavior 
which makes use of the organizational equilibrium concept. An adaptation 
and extension of the Getzels-Guba theory of social systems, it identifies a 
series of procedures for change in public schools along five dimensions: 
cultures, politico-economic environments, institutions. individuals. and 
informal groups. Eight precautions to be observed in the implementation 
of change in a school setting were formulated. 

In the quest for a more adequate sense of direction. in curriculum 
change, Frazier (1963) identified five uses of new insights in curriculum 
theory and practice. His analysis was based on the 1963 Association for 
Supervision and Curriculum Development yearbook, w 
reporting new insights from various fields w 
tions for curriculum. He asserted that new insights may help the educator 
(a) to ask new questions, (b) to re-examine assumplions, (c) to see new 
distinctions that need to be made, (d) to respond more completely to things 
already understood, and (e) to imagine new possibilities, His observation 
that new insights do not always reveal their most effective use unless they 
are studied suggests a crucial dimension in dealing with change in cur- 
riculum development operations. 

Sweitzer (1962) pointed to the value of the 
among two or more theories. Although he 
educational administration, the applic. 


theory and to the study of curriculum change is clear, whether one is 


seeking to identify the major questions to be investigated or trying to find 
a higher conceptual level than that suggested by one of the theories being 
compared. He demonstrated his point b 


rated y an analysis of the complementarity 
between the nomothetic-idiographic model of Getzels and Guba and the 
input-output framework of Stogdill. 


At this stage in the search for more adequate models with which to 
study the dynamics of curriculum change, one senses the need for a model 
of models. Sweitzer's proposal is a step in this direction. 


hich paired chapters 
ith others delineating implica- 


principle of complementarity 
was concerned with theories of 
ability of his view to curriculum 


Diffusion of Innovations 


Research on the diffusion of innovations. 


: > Which has increasing sig- 
nificance in such fields as rural sociology, has not been widely used yet in 
the study of curriculum change. The work of Mort and Furno (1960) stands 


almost alone in this regard. They Proposed a sequential simplex model to 
account for the various classes of influence on schools. The logic of this 
approach rests on measures from a financial continuum taken at three 
points: wealth, actual expenditure, and professional salaries, Although the 
procedure has a sound statistical base and provides a method for comparing 
the impact of certain measurable influences within a group of schools, it 
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fails, in larg — : 
ails, in large measure, to reveal the kinds of data required 10 direct 


curriculum change. 
" broader view of diffusion research was taken by Rogers (1962) in 
neki and sy nities of me than 500 studies ranging from new drugs 
g physicians to hand tools among primitive tribes. His paradigm of 
the adoption of an innovation by an individual in a social setting and his 
method of conceptual variable analysis provide a way of relating theory 
iS research which has applicability to the study of curriculum change. 
Rogers’ listing of 52. generalizations in order to summarize what is known 
about the diffusion of innovations suggests many specific studies yet to be 
made. Studies might be undertaken in a school situation where curriculum 
change is under way to test, for example, his proposition that personal 
influence from peers is most important al the evaluation stage in the 
adoption process and less important at other stages. With data from this 
kind of diffusion of innovation research, the Mort instrument might 


become more useful. 

Although not concerned so directly with the problems 
innovations, several investigators have thrown light on closely related 
aspects. Mackenzie (1962) and Huebner (1962) examined the political 
qualities of curriculum change. Mackenzie proposed a model to show the 
interrelations of three factors in change: social context, participants, and 
participant interaction. He placed these on a vertical axis and located 
three stages of outcomes in the curriculum development process on a 
horizontal axis: identification of focus, development of proposals, and 
implementation through teaching. He then stated six propositions about 
curriculum innovations to test the model and called for the development 
and testing of other models to the end that increased ability to explain and 


control the process of curriculum change might be fostered. Huebner urged 
of political science to help analyze the 


that educators use the concepts o Be) P 
field of curriculum, redefine their roles, and direct new inquiry. As illustra- 
lions, he examined action, elites, and ideologies as these concepts relate 
to educational operations. 

An analysis of the sociology 9 
(1961b) has implications for the stuc 


He iorshi ong st 

charted the relationships among * BE l 

influence reading research | developed a scheme that permits an exami- 
g res 


and 
nation of the social organization of such research, the methods used, the 
D A » 
channels by which it is inlerpreted and communicated, and the response of 
educators to it. This analysis in ends striking support to 


of diffusion of 


made by Barton 


ly of curriculum change in general. 
atus groups and institutions that 


f reading research 


a specific field | 

the more general thesis advanced by Mackenzie and Huebner and suggests 
g à E EU 

a framework for investigating the nature of diffusion in many other areas 


of educati 1 " rriculum innovation. 
cation, including Curr! 3 ; . 
In the field of education, in curriculum change, as well as in such other 


areas as leadership theory and research, there appears to be little concern 

With the problem of power and power relationships. The Mackenzie, 

Huebner, and Barton efforts are exceptions. Political scientist Janda 
, 
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(1960) called for a conception of leadership explicated in terms of end 
Gross (1962), a sociologist, described the lines of integration around which 
action is centralized and maintained in a social organizalion such as a 
school and delineated eight bases of power undergirding these lines. He 
concluded that an individual must have a clear understanding of the power 
available to him and to others in undertaking change. : 
The research and theory-building efforts that focus on the problems of 
diffusion of innovation in educasion, especially innovations in curriculum, 
make one need clear—the need for more adequate agencies for change. 
Numerous proposals have been advanced. The most viable, among those 
surveyed, is that made by Clark (1962), who proposed the establishment of 
new mechanisms for the development of educational research. He called for 
the establishment of a series of research institutes, which, in addition to 
providing support for basic and applied research in six categories including 
curriculum, would also provide for the needed field testing of research. 
He recommended also the establishment of state and regional experiment 
centers for demonstration and dissemination of research findings to practi- 


tioners. These proposals have major implications for changing curriculum. 


Field Demonstrations 


into a field demonstrations category. These efforts ha 
istics of descriptive research, 

An illustration is the study undertaken in N 
Education Department, With a grant from the Fund for the Advancement 
of Education of the Ford Foundation, Brickell (1961) identified barriers 
nge in selected schools. He visited 30 
nine hours each and spent seven days in 
eacher education institutions were visited. 


ew York for the State 


ent is a failure to distinguish among these 
s of the process of change in medicine, 


stimulate and to finance the d 
programs. To disseminate the pro 
mended the creation of regional development units, 

Two serious weaknesses, which t 
studies of change, stand out in this study. The first is its narrow focus. 
Brickell grossly oversimplifies the co 
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it with shifts i > arr six s s s 

ii ith > hift: ain the arrangement of SIX structural elements o. the inslilu- 

ion: teachers. subjects s. Lime: cs. Se there 
n: teachers, subjects, students, methods, times, and places. Second, there 


is : : , i 
an absence of any clear conceptual framework to provide the necessary 
bserved change. These weak- 


e study by closing off from 
hich the specific recom- 
applicability, 


guidelines for assessing the quality of the o 
nesses limit markedly the usefulness of th 
wider public consideration the basic data on w 
mendations for New York State rest. Its transfer value, or 
in other situations is thereby seriously reduced. 

Trump and Baynham (1961) reported, for the Commission on the 
Experimental Study of the Utilization of the Staff in the Secondary 
School, a study which focused on change. It described the projects de- 
veloped in 100 junior and senior high schools involved in the Commission's 
study of staff utilization over approximately a four-year period. The 
authors did not include statistical data collected by the schools, inasmuch 
e been made available elsewhere. However, they did 
d. among other things, the following: 
udents with improved learning 


as some of these hav 
generalize on such data and conclude 


Innovations in the schools provided st 
Opportunities. Teachers were more satisfied. Principals exerted more effec- 
interest in experimentation was 


tive instructional leadership. General 

created in school and community. They then addressed themselves to the 
question of how other schools s e for change. Among the 
necessary conditions they recognized an factor: Change in- 
Volves people. They also emphasize sound research and 
evaluation procedures. 


This study suffers from shortcomings simila 
report, Despite its insistence that individuals and groups are central in 
ad that adequate research and evalua- 


the process of curriculum change 2! : $ 
tion procedures must be used, great voids exist between the available data 
and the conclusions reported; that is, one is required to accept their con- 
clusions almost on faith. Herrick commented in the June 1960 Review 
that the extensive surveys of the Commission during the period 1957-60 

al aspects of staff re- 


ini iv hanic 
Were centered on the administrative and mecha A E Sede 
Organization rather than on “corollary matters © curriculum change. 


his view applies to the more recent work of the Commission as well. Yet, 
4 H 

because of the generally confused state of curriculum development research, 

o 


; ic i d because of the 
especi shi ith the dynamics of change, an t 
piken hap dr eT I : d Baynham reports, which 
ip of the br 


kell and the Trump an port 
gives them high public visibility. ndoubtedly will wield con- 


both studies u 

Siderable influence. 
Frazier (1962) rais 
ministrators as curriculum lea 


hould organiz 
was the hum 


d the need for 


r to those of the Brickell 


basic: questions about the role of “ad- 
‘a ten which he applied to Brickell’s work. 


they have relevance also for many aspects of the Trump rye e a 
or a number of other efforts to analyze the yo a. x 
Yelopment using a field demonstration approach. Frazier was concerned, 
o 


ices that involved the 
or e i ‘eluded classroom practices t 
xample, that Brickell exclu In fact, the crucial role of the class- 


teacher at work with his own pupils 
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room teacher in curriculum change seems to be denied. Frazier urged that 
the process of curriculum development be redefined to make room for new 
functions of both teacher and administrator and that purposes be kept 
sharply in review throughout the process. 


Redefinition of Change 


'Taba (1962) advanced a strategy for changing:the curriculum which 
takes into account the questions raised by Frazier. Moreover. pursued as 
an approach to curriculum experimentation, it could provide the data that 
seem to be missing in the Brickell and Trump and Baynham analyses of 
change. She called for a deliberate inversion of the common procedure 
of starting with a general design. Instead, she proposed that innovation 
start at the teaching level with the planning of specific teaching-learning 
units, The results of experimentation at this level then would provide ü 
basis for the creation of a general design during a later phase of the total 
process. This approach is calculated to infuse theory into the operations 
of the practitioner from the outset. Although Taba's proposal is specula- 
tive, she drew on a wealth of experience as a curriculum consultant and 
marshaled substantial support for her position from the behavioral 
sciences. Carefully planned experimentation with the inversion approach 
would add significant knowledge to the curriculum development field and 
would represent an important step forward in the redefinition of curriculum 
change in order to make it researchable. 

Within the descriptive category, the work of McN 
stands in sharp contrast to the Bricke 
use in assessing curriculum change or in 
analyzed organized efforts to improv 
from a relatively small suburban scl 


ally and others (1900) 
ll report in terms of its potential 
planning innovations, This volume 
e curriculum in seven cases ranging 
á : 100] system to a state-wide program. 
From this analysis, the authors formulated 11 guidelines for improving 
programs of instruction and proposed criteria for evaluating change in 
program. Both formulations make explicit the values involved in the 
curriculum development process. Thus, they furnish essential resources 
for curriculum theory and practice not provided in many descriptive 
studies that fail to recognize the role that values have in the total proces? 

Another example of descriptive research that has promise for con- 
tributing to the redefinition of curriculum change is the study Guggen- 
heim (1961) made of 16 school districts in New York State to identify 
patterns of staff utilization. Using 3 case study approach to appraise ihe 
communications systems in the schools, he followed a unit of information 
that was involved in a referral on a deviant pupil through the organiza- 
tional system and found out from each recipient of communicated infor- 
mation data how information was received. The Lazarsfeld-Robinson 
pseudotest approach to quantification was used. Guggenheim found that 
the full utilization of staff is highly related to the kind and quality of a 


310 


dt 


June 1963 j 
' 1963 CURRICULUM DEVELOPMENT: Dynamics OF CHANGE 


school's cha - Pee r 
"rs able rbi e a rar epee re phon decli 
cations to staff utilization and sehuul pa udy k; — Ht: of communi- 
urna culto pi idis eu HER e rh the research 

Teckman (1961) studied ra ‘influ oan de an —— s 
"E peat del he in ean o Kiate epartments of education 

S a ing associations on experimental projects in 39 
Using an extensive questionnaire and an 
bee atements, and published criteria, he found 
retina i " ae 100] leaders had received support for experimentation from 
rece fing that the standards and policies of such agencies did not 
et |. oa spa schools to conduct experimentation. Moreover, the majority 
I. s sapien in the study indicated that the present standards of state 
atts ents and regional associations are unable to keep pace with postu- 
me new practices in secondary education. He called for a redefinition of 
the role of these agencies as change agents. 

Alexander (1962a, b) identified some priorities in curriculum develop- 
ment including the establishment of a system of curriculum research coun- 
cils at the local, state, and regional or national levels. He sought in this 
Way to foster a more adequate screening. review, and dissemination of 
curriculum research. 

Wrenn (1962) speculated about the conditions that are likely to keep 
changes which were recommended by the American Personnel and Guid- 
ance Association's Commission on Guidance in American Schools from 
taking place and identified three cultural encapsulations that contribute to 


resistance to change and four preventive measures that might be taken to 
stance. Although aimed at the counselor, his recommenda- 

other educational personnel who have helping 
ar to those of the counselor. Seen in 


redefine the nature of change. 


selected secondary schools. 
analysis of standards, policy st 


ee this re: 
lons av . 
E. may be generalized to 
"a change-agent responsibilities simil 
ae Te 
is light, they represent an effort to 


Roles, Role Perception, and Personality 
Structure in Change 


-building kinds of operations 
amics of curriculum change, 
Some of them are theory- 


In moving from the more speculative theory 


Within the total spectrum of research on dyn 


Certaj à 
ertain descriptive studies have been noted. c hem L 
oriented, while others function as field demonstrations with little, if any, 


relation to a larger conceptual framework. However, the largest number 
of intvediiontiona made during the period under survey falls within yet 
another catewoce These are the specific, often isolated, studies that probe 
i Single aspect of the curriculum change complex. The most prevalent 
ocus for this research has been on roles. role perceptions, and personality 
Structure, 

practices held by 804 ele- 
identified through 
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a questionnaire survey and reported in a monograph by the National Educa- 
tion Association, Project on Instruction (1962). To identify some of the 
forces influencing change, principals rated a list of 14 possible sources of 
influence. Of these, the respondents indicated that local school officials had 
exerted the greatest influence. Faculty influence was rated second, and 
state school oficials were third, with research studies holding fourth place. 
Various national studies in academic subjects were a close fifth on the 
composite list, In general, the replies indicated that national studies have 
increased significantly in their influence from 1956.to 1961 but that little 
additional increase is expected between 1961 and 1966. 

Anderson and Anderson's (1962) study of more than 10,000 children 
and teachers from eight countries with projective, incomplete stories indi- 
cated clearly that authoritarian cultures do tend to induce defensive be- 
haviors among children and teachers more frequently than do democratic 
cultures. One can infer from these studies that the extent and nature of 
curriculum change may be directly related to the kinds of personality 
characteristics evident among the teachers involved. Future studies should 
be initiated to explore this implication. 

Flanders’ (1960) intensive investigation of teacher influence, pupil atti- 
tudes, and achievement is an especially important study in that it was care- 
fully framed within explicit theory and was aimed at developing instru- 
ments and procedures designed to help teachers change their classroom 
behaviors, Moreover, it included, in the research design, a liaison com- 
mittee of parents, teachers, principals, and supervisors to oversee the entire 
research effort. Interaction analysis, a precise technique for studying verbal 
activities in a classroom situation, includes working with teachers so that 
professed goals can be compared to attained goals. The fact that Flanders 
translated the results of this research into a series of filmstrips and record- 
a ig yan rimis in in-service education programs is also note- 

y. l'ew researc ners attempt to bridge the gap between their activities 
and classroom practice. These materials are especially significant in that 
they provide a way to induce curriculum change by involving teachers di- 
rectly in a process which has been substantiated carefully by research. 
Few in-service programs have such a base. : 

In the area of interpersonal perception, Rosen, I 


(1960) used the individual interview to study 64 preadolescent, under- 
privileged boys in a four-week camp situation, They found that, in a peer 
group, the desire for change in oneself or in others is related to the relative 
resourcefulness of individuals. While this finding clearly cannot be gen- 
eralized to groups of teachers and administrators involved in curriculum 
development, it does suggest possibilities for a number of other inquiries 
into the whole matter of how the concept of resourcefulness functions in 
the way individuals and groups see themselves in the process of change. 
Schutz (1961) grouped 70 educational leaders, involved in a series 
of 10 meetings, according to their reactions on a questionnaire designed 
to measure an individual's expressed and wanted behavior in the inter- 
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personal arcas of inclusion, control, and affection. After meeting six times, 
these groups composed of persons with individual interpersonal needs as 
determined by the questionnaire were able to identify accurately their 
ssibilities. This research suggests 
and Lippitt, that certain 
viduals grouped 


own group description from among five po 
strongly, as does that of Rosen, Levinger, 
group tasks might be undertaken more effectively by indi 
in terms of a measure of their interpersonal needs. 
. In another research undertaking, Schutz, and others (1961) applied the 
VIRO theory to a study of the interpersonal behavior of school administra- 
lors, Eighty scales were developed in which the Guttman scaling technique 
was used. The findings throw light on the nature of individuals who com- 
monly fulfill the change-agent role in the curriculum development process. 
For example, teachers were found to respect the principal similar to them- 
selves with regard to how he handles difficult situations. Also identified 
in the study were two groups of people whose attitudes toward educa- 
tion seemed closely related to the way in which an individual perceives 
others. One group displayed positive attitudes toward others and viewed 
the functioning of the school in a more open manner. The other group 
displayed a more conventional orientation and viewed the total educational 
process within a less flexible framework. 

Seeman (1960) used change as one of four aspects of leadership style in 
the total cultural approach he took to the situational study of 77 school 
executives in 26 Ohio communities. His findings have implications for 
identification and training of leaders to guide curriculum development. 
For example, he discovered that there is a clear carry-over of status ratings 
and orientations from one situation to another. This finding suggests a 
number of related questions to be investigated, if one assumes that cur- 
riculum change involves the community as well as the status leaders in 


education. 
Comparing teachers" and à 


role, Fishburn (1962) noled a fundamental dis : J 
groups. Whereas teachers saw their most important functions as mediator 


of the culture and director of learning, administrators saw liaison between 
school and community as the teacher's chief role. Hencley's Dau aner 
of board member and superintendents’ perceptions of oa = an eac 1 
others? roles also illustrates that these groups have con pei ames Spar 
about their areas of responsibility and authority. aoe aoar i Tead : hi 
Harris (1962) reported that graduate students perce i " ' a d es 
roles corresponded directly to those of practicing pee un ies 
Sultants, it may be that programs of tramin - € poem dei 
are helping perpetuate certain basic misconceptions among g 


dministrators’ perceptions of the teacher’s 
agreement between the two 


of the profession. 

Drawing on his experienc 
a stasis-process continuum 
relationships between these ideas 
Conditions that seem to be present wh 


psychotherapy: Rogers (1961) outlined 

ange. Exploring the direct 
he then spelled out the 
al moves from stasis 
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toward process in the act of becoming. Assuming that motivation for 
change and learning springs from life itself, Rogers describes the conditions 
that can be provided by the therapist and that scem essential for desirable 
change. These include congruence, unconditional positive regard, empathic 
understanding, and awareness on the part of the client that these conditions 
exist in the therapist. Although the concepts have not been substantiated 
by extensive research in either psychotherapy or education, the framework 
of the theory should become clearer and have more relevance as hypotheses 
regarding change emerge and are tested within its confines. 

Bills (1963) cited several studies based directly on Rogers’ concepts. 
The more a teacher was open to his experience, the more his students re- 
garded him as congruent, empathic, positive, and unconditional. The 
locus of decision making for the more-process teachers was centered in the 
class, including the teacher. On the other hand, the locus of decision 
making for the less-process teachers was scen as centered cither in the 
teacher (autocratic) or in the students (laissez-faire). In another study 
cited, Bills reported that teachers and administr: 


experience changed significantly in their attitude 
others as 


ators who were open to 
s towards themselves and 
a result of a summer workshop experience. Those persons 
who were less open to experience, however. were unaffected by the work- 
shop. These studies suggest that those who seek to change the curriculum 
by involving teachers must contrive in-service education efforts that are 
tailored to meet the degree of openness of the various teachers involved. 
No single approach to the problem seems adequate. 

These studies, taken as a sample of w 
dynamics of role, role perceptions 
tional change, suggest the import 
riculum development progr. 


ide-ranging investigations into the 
. and personality structure in educa- 
ance of continued investigation in cur- 
ams of the general sensitivity training approach 
which has characterized efforts in the field of business research, Helfant’s 
(1961) study of the effects of three human relations workshops held in 
the Manhasset Public Schools in New York underscores this necd. He 
found that group therapy was effective in modifying the attitudes of ad- 
ministrative personnel and in maintaining a preventive mental hygiene 
climate. 

Shumsky and Mukerji (1962) studied the role of the consult 


ing about changed behavior in action research projects and concluded 
that (a) the way the consultant relates himself to the teacher-researcher 
during the implementation phase of the undertaking is highly significant 


and (b) the deepening of the teacher's self-understanding is a key factor 
in changed behavior. 


ant in bring- 


. 

In a study of the personality characteristics of teachers who were rc- 
sistant to change, Myers and Torrance (1961) 
characteristics: authoritarianism, defensiveness, 
sensitivity to pupils’ needs. low energy level, 
lion-giving functions. intellectu 


identified the following 
domination by time, in- 
preoccupation with informa- 


al inertness, disinterest in promoting initia- 
tive in pupils. preoccupation with discipline, 
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endis trig ladies mei Tis folus te 
i ore S i spores o 4 teachers in 14 states to the request 
that they report how they applied five principles in their teaching. The 
principles, which suggested ways of rewarding creative thinking in the 
classroom, were (a) treat questions with respect: (b) treat uin 
ideas with respect; (c) show your pupils that their ideas have value; (d) 
occasionally have pupils do something for practice without the threat of 
evaluation: and (e) tie in evaluation wit causes and consequences. 

Cox (1961) reviewed four studies made at the University of Georgia, 
the findings of which strongly supported the importance of viewing the 
role from various perspectives. A similar out- 
ho chaired an Association for Super- 
tee's survey of the 


instructional supervisor's 
come was reported by Guss (1961), w 
vision and Curriculum Development research commit 


perceptions of the supervisor which various individuals held. 


Contributions from Related Fields 


Recent summaries of research and theory in the field of business manage- 
testing in any comprehensive effort to 


ment suggest approaches that merit i 
study the nature of curriculum change. Likert (1961) reviewed extensive 
research carried out since 1917 at the Institute for Social Research of the 
University of Michigan and developed, among other things. (a) principles 
ganization and (b) 


that describe the interaction-influence system in an org l 
a comparative analysis of organizational and performance characteristics 


of different management systems, ranging from authoritative to participa- 


live, 
Tannenbaum, Weschler, and Massarik (1961) reported work repre- 
sentative of a 10-year research and theory building effort of the Human 
School of Business Administra- 


Relations Research Group of the Graduate f Bi à 

tion, University of California, Los Angeles. A basic idea underlying the 

work of this croup was that leadership, influence, and organizational 
om tee 


effectiveness all bear directly on improved human capacities to diagnose 
interpersonal situations and to act effectively on the basis of this diagnosis. 
In short, their view of change: which builds on the pioneer concepts of 
Lewin, Rogers, Moreno, and Bion, is reflected in the more recent efforts 
B gers, , T wr 

of such men as Benne, Bradford. Lippitt, and Thelen, — 
Vroom (1960), working with data obtained from 1,676 employees of 
; : à 
a large industrial organization, found that persons tend to attribute their 
own attitudes, opinions, and te persons. groups, and organizations 


toward which they have a positive attitude and to deny them in persons, 
Broups, and organizations toward which they have a negative attitude. 

In a comprehensive effort at theory building, Presthus (1962) analyzed 
research from several social science areas as it applied to the behavior 
of individuals within bureaucratic organizations. Having underscored the 
dynamics that create conflict between persons who strive for professional 
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competence and those who seek stalus and power through administrative 
roles, he then delineated the aspects of large organizations that affect the 
functional relationships of members within the structure, Individuals who 
identify strongly with the organization, who aspire to administrative posts. 
and whose major satisfactions come from acceptance of and involvement 
with the institutional values and goals are characterized as upward mobiles. 
Those whose patterns of accommodation include simultaneous rejection 
of the organization’s purposes and acceptance of the security available 
for those who go along are identified as the indifferents. Neither of these 
major groups promotes or encourages critical examination of an or- 
ganization’s structure, purposes, or values. Ambivalents, however, who 
represent a dissident minority group whose behaviors are essentially intel- 
lectual and introspective, provide concern for innovation and change 
through their probing criticism and resistance to authority. Seldom func- 
lioning even in relative harmony with the organization, the ambivalents 
ultimately alienate themselves from the organizational elite. This model 
should generate fruitful hypotheses for researchers interested in the impact 


of institutional structure upon educational goals and the nature of change 
within the institutional setting. 


Need for Further Research 


By way of conclusion, perhaps the best way to sense the state of the 
field—beyond whatever insight one gains from the preceding review of the 
literature dealing with the dynamics of curriculum change—is to focus 
the several major problems that clearly require further research. Three 
such problems are apparent. 

One of the basic questions that has not been studied directly is whether 
involvement in the research process itself is j 
curriculum change. Barnes and others (1960 
book of seven classroom teachers! efforts to improve instruction through 
research. It implied that those problems that stem directly from classroom 
teachers’ interests and difficulties offer unique possibilities for curriculum 
change by involving those most immediately concerned in practical studies. 
This is, essentially, the action-research approach. 

Inherent in this approach is the basic idea that curriculum research 
from the bottom up is the most productive 
bring about curriculum change. A few school 


an effective way to bring about 
) published a descriptive hand- 


and most effective way t0 
systems, however, maintain 
highly organized research divisions »yhose purpose is to initiate curriculum 
research from the top down. Which approach is more effective in en- 
couraging significant curriculum change should be examined: Speculative 
treatments of this problem are necessary but not sufficient. 

Two other problems are related to the question, Who should initiate 
curriculum research? The first concerns research design. Many persons in 
university and governmental research centers are urging educational re- 
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searchers to devise more elaborate and complex. research designs. In an 
obvious effort to cope with the many variables inherent in the total pratese 
of educational change, researchers are encouraged to conceptualize their 
projects in such a way that as many of the important variables as possible 
are brought within the framework of their design in some manipulatable 
or controllable way. 

These efforts natuzally result in research reports that are complex 
and difficult to understand. Some reporis regularly employ statistical 
that the typical classroom teacher or school 
administrator has not been trained to understand. In view of the resulting 


maze of detail and unfamiliar terminology, there may actually be a widen- 
esearch data and those most interested 


assroom situations. 


concepts and procedtres 


ing gap between those producing ri 
in utilizing the results of these studies in cl 
l'wo hypotheses seem worth exploring in this regard. First, Is there a 


need in the field of education for some type of dissemination service or 
agency that would function in a manner comparable to the county agent in 
agriculture to help teachers and administrators relate the best research 
results available to the local school situation? Could such an agency both 
strengthen and broaden the communication bridge between researchers and 
practitioners to the extent that curriculum would be affected in any ap- 
preciable way? Brickell (1961) and Clark (1962) both made proposals 


along this line. 

Second, Should those persons 
establish policies and structures, [o : 
regularly engage teachers, parents, and administrators in effort 
across organizational and grade level lines 1n order to improve —À 
through research at the local level? McNally and others (1960) and 

iable approaches to this 


Alexander (1962b) outlined what appe 
cale researc 


ositions work directly to 
within their own school districts, which 
fforts that cut 


in leadership p 


ar to be v 


problem, but further large-s h is required. . 
s Hd arch at all which bears directly on 


Although there is apparently no researc! 1 i : 
one final problem, the growing surge of interest in both professionalism 
and curriculum concurrently would seem to suggest that educators might 
Well explore the impact of the structure of the profession itself on cur- 
riculum development and curriculum change. Does the nature of no 
fessional relationships that currently prevail among principals, teac ners, 
and superintendents inhibit the process of curriculum den CR in any 
Important way? In other words, Is there any evidence that the general 
reluctance of teachers tO initiate or participate in extensive curriculum 
change is a direct function of the kinds of relationships they have with their 
administrative superiors? Can those who work directly with curriculum 
bring about significant improvement in the instructional program in a 
Profession dominated and controlled by persons whose interests in the 


curriculum ipheral? 

are periphera!* z " . 
Though nò treet evidence is available, it would seem that those groups 
that have strengthened their professional status through stronger organiza- 
= limitations upon their membership through 


lion and that impose definite 
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internal controls have simultaneously increased their effectiveness. There 
would seem to be more than a casual relationship between. the rise of 
the physicians in professional stature, for instance, and their ability to 
achieve the basic purpose of maintaining ever higher levels of personal 
health among the public at large. In practice, this increased. effectiveness 
seems to have come about by the physicians’ efforts to contrive ways of 
helping themselves to be both aware of and affected by the most recent 
medical research. Significant progress in curriculum development is not 
possible until there is a general strengthening of the teaching profession 
through shifts in the power structure. Clearly, research into the dynamics 


of curriculum change must cope with this kind of problem, however 
complex it is. 
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CHAPTER IX 


Curriculum Rescarch: Problems, Techniques, 
and Prospects 


JAMES B. MACDONALD and JAMES D. RATHS 


Tus CHAPTER is concerned with research on curriculum research. In 
other words, its purpose is to locate studies that have focused on models, 
procedures, designs, and techniques in curriculum research, which enlarge 
understanding of the task and provide guidelines for curriculum research 
workers. 

Although reports of specific studies designed to attack this problem area 
are sparse indeed, discussion of the problem is in evidence. Under the 
advice of the chairman of this issue, the authors have attempted to put 
together a commentary on research in curriculum research which, it is 
hoped, will stimulate future studies in this arca. 

What is research on curriculum research? If one may take the lead from 
Gage (1963) concerning research on teaching, four concept definitions 
seem appropriate: curriculum, research, curriculum research, and rescarch 
on curriculum research. 

Curriculum is defined as status of or change in pupil behavior which 
is pertinent to the goals of the school. Research is defined, after Gage 
(1963), as activity aimed at increasing one's power to understand, predict, 
and control events of a given kind. It follows that curriculum research 
is research in which one or more of the variables represent the status © 
or change in pupil behavior which js pertinent to the goals of the school. 
Research on curriculum research, then, becomes an activity aimed at in- 
creasing the power of research methods designed for understanding, pre 
dicting, and controlling curriculum phenomena, 


Problem of Relational Contexts 


Research on curriculum research can profitably look at the problem of 
relational contexts. There are at least three contexts which come to mind. 
First is the context of theory. As Coladarci (1960), Gage (1963), Getzels 
and Guba (1957), Herrick (1961), and Willower (1962) have justly 
noted, research in curriculum must be guided by use of models, theories: 
or paradigms if hypotheses that can be related. meaningfully to a body 
of knowledge are to be generated. y 

Much curriculum research has been unrelated or unrelatable to theorel!- 
cal frameworks. The resulting dilemma has been both a lack of coherent 
direction in curriculum research and a failure to provide adequate frames 
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of reference agi aoi r 
sits Le deed —— progress in the field or for correlating research 
s. Lacking the conceptual tools of older sciences, curriculum research 
nust combine the process of hypothesis construction with theory de- 
velopment. 
F The significance of hypotheses without theories has been criticized. 
ei example, Carroll (1961) suggested that educational psychologists have 
ailed to provide sufficient knowledge about some central problems in 
school learning. He stressed that research should result in statements about 
eel effects teaching procedures and other actions would have upon the 
inds of student behavior one does or does not desire as outcomes of 


education. Fisher and Roth (1961) suggested multiple hypotheses, for 
enomena in their complexity. 


any study, as a way of examining relevant ph 
ould, by necessity, 


Multiple hypotheses and their stated interrelationships w 
involve the use of models, theories, or paradigms. 
Joncich (1962) attempted to describe the history of the development of 
educational rescarch efforts and to indicate some of the present problems. 
Central to her criticism was a concern that curriculum research had been, 
and was still, guided by the progressives rather than by the neutral 
and objective workers in the field. Apparently. her statement refers to 
the substitution of a philosophy for a theory, as a generator of hypotheses. 
Her caution is well put. However. the authors are much more concerned 
about biased research, which is done to achieve some practical goal in 
school systems, than they are about the danger of progressivism in re- 
search, It would be useful, however, to have empirical evidence of the 
values which researchers hold in relation to their research activities. 

Tt would appear that curriculum research efforts could be improved con- 
siderably through the use of adequate frames of reference within which 
hypotheses are generated. The testing of these hypotheses would have the 
dual effect of providing data both about theory and about the original con- 


Cern or felt need. In addition, it is only as this criterion is met that null 
n any meaningful way to ones knowledge. 


dom are able to contribute i ied. effort t dd 
z P 3 i certed effort toward th 
Ver a reasonable period of time, with some con e the 

one’s understanding 


Systematic use of known conceptual frameworks, n e 
about the relational context of a theory would be considerably improved. 


5 j 1 theory in research would go 
hus, research on the relational context of ) g 


on side by si ‘ culum research per se. 
à side with curricu r i 
2 second relational context is that of previous knowledge and research. 
this is inseparable from theory, an area of 


lth ; 
ough ; ways 
gh, in many ways: $ n B 
unknown dect could be explored if curriculum research were à ie 
1n terms of adequate communication cf endeavors among curriculum re- 
Searchers, It "x t be well to study the effects of adequate communication 

ex 1g relevant to this problem may be 


9n curri ch position 
riculum research. One suc? P failure of many researchers and 


terline? is ae bout the 
erüneg's (1959) suppositions a Mond bi interesting to know the number 


editor. 3 
S to publis ]l results. s ; 
poor idea. i see been retested because of the failure to communicate 
Previo 
us null results. 
results - 
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'The third area of relational contexts in curriculum research to be con- 
sidered is that of the empirical setting of the activity. A slight modification 
of Gage's (1963) continuum of research can be used to illustrate this 


context: 
hu ——————— —ÓÓ— URNA 


Basic Basic Educa- Classroom Field Installation 

science: science: tionally experimen- testing of program 

indifferent relevant oriented tation 

content content problems 

Pure Simulated Applied 
RESEARCH 


ee E ——Áe———— 


In terms of the writers’ definition, curriculum research would probably 
not be involved in the first two categories of the pure end of the continuum. 
As the continuum moves toward the applied pole, there is a marked 
change in the relational or situational context within which the research 
is done. Thus, one may move from studies of problems of learning in 
laboratories to the installation of a new mathematics program based e 
newly discovered principles of learning. Not at all clear, at this point, 18 
whether a given aspect of an educational concern needs to progress 
through all stages, many stages, or a few stages. Of inestimable value would 
be knowledge of the effects of situational contexts on the findings of 
curriculum research. 

A further related problem of changing contexts is the question of what 
should be done. Whether one likes it or not, curriculum research tends to 
go beyond the problem of what is and into the problem of what should 
be done. Curriculum decisions involve more than data; they involve 
values and other reality (e.g., practical problems). Thus, the application 
of a statistical test of significance to a problem in curriculum research 
does not end the problem-solving cycle. In other words, although the 
curriculum researcher organizes his research design on a what is basis, be 
cannot forget that his activity is applied research. He must remember that 
the cycle of problem solving will not be complete with a significant finding. 
There would appear to be an extremely valuable area of concern for 
curriculum research on the problem of how practitioners can move from 
results to applications in the most efficient and most useful manner. In other 
words, there should be some research concerned with the broad logic © 


applied research, which attempts to fit what is studies into what shoul 
contexts. 


Problems of Design 


As mentioned previously, most of the concern about designs used in 
cürriculum research has been manifested on the verbal level rather than 
on the level of inquiry and research. For instance, Strang (196 
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discussed weaknesses in the matched-groups approach as well as in corre- 
lational or factor-analytic studies. She suggested a single-group experiment 
in which all factors in a given set are described. Whether this is a feasible 
alternative or not, Strang did focus attention on the problem of design in 


curriculum research. 

In this context, there have been many 
the null hypothesis model for decision making, as evidenced in the papers 
by Green and Tukey (1960), Kaiser (1960), and Nunnally (1960). Under- 
lying these questions is the conclusion that, since the null hypothesis is 
almost never true. an experiment to reject it is therefore unreasonable. 
Such arguments continue by asserting that, if an experiment fails to reject 
the null hypothesis, one conclusion, which is almost always possible, is 
that the sample was too small. Although there is a great deal of merit in 
the logical evidence attacking this model, research studies testing its 
efficiency as a decision-making model are lacking. Perhaps Sidman (1960) 
offers some direction to curriculum research workers when he espouses 
research focusing on individual behavior patterns. He asserts that group 
designs may fail to identify similar patterns among groups that have differ- 
ent means. Probably Sidman’s most telling argument is his questioning of 
the process of randomization of uncontrolled variables. 

Another pertinent area that involves necessary decisions by researchers 
is the establishment of levels of significance. More and more researchers 


are taking a hard look at the sacred cows of decision making—the at and 
-05 levels. Winer (1962), for one, suggested that, at times, even the .20 or 
-30 levels of significance might be appropriate. This area of — could 
be one of legitimate interest to researchers. An attempt to esta blish y 
guidelines for setting up appropriate levels of significance in lap o 
sample sizes, powers of statistical tests, and other ege certainly seems 
advisable. This problem is also related to the context e oe m t 
mentioned earlier. Perhaps decisions about significance of Am linge shou 
be left to the users of a study rather than to researchers or editors. 
In an article directed at the central concern of this chapter, Lambert. 
r an experimental 


Miller, a Tiley 2) cited three requirements for 
9 ond EIUS DS ndn (a) an adequate sampling of a research 


duction of treatment conditions that could be expected 
among treatment groups. and (c) use of sufficient 
h of the experimental groups. This last require- 
t of power of statistical tests— 
ize of the experimental groups. 
mbert, is dealt with by 


serious questions raised about 


design in curriculun 
population, (b) intro 
to maximize differences 
numbers of subjects in eac 
ment is obviously related to the) pounce 
à quantity that varies directly with the s 


Another i on which, according to La 
ertinent question . e m" : . 
all pea ae dan either implicitly or explicitly is how much of 


an observed difference is required if it is to be judged significant. Of 
course, this question also directs the researcher's attention on the power 
> s 


of the statistical tests in his design. 
ical tests 1n S " à g 
The assertions with reference to curriculum research designs that have 


been stated in this section are. in the main, a result of conjecture. It is 
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important that these assertions be tested in empirical settings that afford 
appropriate experimental designs. Of considerable aid to research workers 
in curriculum is Campbell and Stanley's (1963) chapter in the Handbook 
oj Research on Teaching, in which 16 experimental designs are treated 
within the framework of 12 common threats to statistical inference. 


Other Issues in Curriculum Research 


Strang (1962) suggested that education is not yet a science and that pre- 
mature attempts of educators to be scientific have led perhaps to mis- 
leading conclusions. Her argument seems to be focused on the rather weak 
or untested evidence-gathering techniques which researchers use. 

In a vein similar to Strang's, Carroll (1961) identified the regressive 
maneuvers which educational researchers tend to fall back on—the usc of 
questionnaires, supervisors' ratings, self-inventories, and the methodologies 
of statistics. He asserted that a direct attack is needed on the crucial varia- 
bles of learning rates and attitudes, motivation, quality of instruction, 
application of the laws of learning, and teacher competence. Educators 
readily acknowledge that many factors affect behavior and that these many 
factors act in an interdependent fashion. It is interesting to note that educa- 
tors rarely have definitive evidence to suggest which variables of concern 
are really independent and which are dependent. Certainly, specific varia- 
bles may play either role in various research designs in terms of the labels 
assigned by researchers. However, the act of calling variables independent 
certainly does not assure that they are truly independent. In spite of this 
apparent insight into the complex relationships of variables. curriculum 
experiments generally concern themselves, as Fisher and Roth (1961) have 
a ee 

: pear to be caught in a dilemma. Either they 
must wait for some theory or method that will allow them to deal with all 
crucial variables in curriculum research, or they may continue to work 
with a single factor or with several factors in anticipation of the ultimate 
vs ain ue — questions about uncontrolled or overlooked vatiables 

eing raised at the close of the experi i anc 
Sidman's (1960) views would Je puisse dps ! s rer 

$ ggest that, in curriculum 

search areas, experimental designs be waived temporarily. Curriculum 
researchers could resort to careful observations and descriptions until such 
time as the datum of curriculum research is identified and generally 2€ 
cepted by researchers. Then, meaningful hypotheses could be generated an 
tested. > 

The position regarding curriculum research, as defined in this chapter: 
may find some support in Gage's (1963) observation that use of measures 
of pupil gain or pupil growth as the ultimate criterion of teacher effective- 
ness is not feasible, at this time, in light of both conceptual and practica 
difficulties in use of the pupil gain criteria. Therefore, it may be reasonable 
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to hold i se d : " 
i dl onis pagato ec E Ao vam d 
understand and make predicti le q arial sa oo a d fo 
this is done, it would : p i g ^r validis = these atens: Onee 
Pi du seem that more fruitful research in curriculum could 

Gage’s position is quite relevant, in 
curriculum field. What is the ultimate criterion for judging the effectiveness 
of new teaching methods, content approaches, or administrative arrange- 
ments? The ultimate and ideal criterion would seem to be, as Gage suggests 
it is for teacher effectiveness, pupil gain or pupil growth. However, on 
closer examination, this issue seems to be highly complex. What of the 
findings of the television studies that reported essentially null results con- 
cerning achievement differences between groups of students in television 
? Does this evidence suggest that one abandon 
on, or, instead, that one should have all 
actors should be assessed? Since 
it needs to be explored 


another sense, to researchers in the 


and in nontelevision classes 
television as an effective innovati 
instruction via television? What other f 
this criterion issue is of paramount importance, 
by curriculum research workers. Unlike other issues raised in this chapter, 
empirical sources may not be adequate for validation of an assertion in 
this area. It may well be that a psychological orientation involving a 
systematic consideration of values will best serve as a basis for answering 


this question. 


Conclusion 


Although there is almost no research on curriculum research, the limited 
evidence presented from a review of the literature would seem to indicate 
that there certainly are a multitude of problem areas that could profitably 


be examined. 

The area of relational co 
communicating knowledge 
broad contexts of an applie 
Problems of design cannot pa 


different designs have been attempted : 
mental designs in contrast to careful observation seems appropriate for 
o 


study. The need for common concepts, valid sources of evidence, and 
relationships of relevant variables, including the criterion variables, seems 
to be open to rigorous examination within any design problem. 
Basic to all ‘curriculum research is the question of whether research 
r fields are appropriate for curriculum research. 
are appropriate. and then again maybe there 
is a need for new curriculum research models po se. Ae Mg wayi in xus 
these questions can be answered iS throug ai a Su um 
1 researchers should give this 


research. It seems self-evident that curriculum 
activity hi : 
Clvity high priority- 
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CHAPTER X 


Curriculum Planning and Development: 


A Review of Reviews 


WILLIAM M. ALEXANDER 


Ix THE first “Review of Reviews" to appear in the February 1962 REVIEW, 
Harris stated that such a chapter should “present a judgment of the 
extent to which the issue provides more than an abstracting service—valua- 
ble as that service may be? (p. 103). The present reviewer of reviews sees 
this judgment, for this particular issue, as answering five questions: What 
studies reviewed seem to be critically significant? What new and promis- 
ing areas of curriculum research appear to have been opened? What con- 
tributions have been reported to a theory of curriculum planning and 
development? What new and promising procedures of curriculum research 
have been identified? What is ahead in research for curriculum planning 
and development? 

In answer to the last question, in capsule form, the reviewer is definitely 
optimistic about the future of curriculum research. This area, notable 
mostly for neglect since the 1930’s, is coming to life again. True, little 
within itself that is worthy of note has been concluded during the past three 
years. Indeed, this reviewer can say again what he did in introducing the 
June 1954 curriculum issue of the Review: “the chief value of the present 
issue is to point up the need for a more significant effort on the part of 
educators to undergird their curriculum theories and practices with specific 
research studies” (p. 193). Now, nine years later, a statement can be made 
that was not made in the June 1954 issue: namely, significant beginnings 
have been made that are prophetic of meatier curriculum issues of the 
REVIEW in the future. By June 1969, if all goes well with cooperative 
research efforts now initiated or being planned, one would be privileged 
indeed to review the reviews of curriculum research. If the 15 years ending 
in June 1969 witness the rebirth and flowering of curriculum research, the 
profession will be able to take great pride in the accomplishments of these 
years. 


Studies of Critical Significance 
Probably the most promising research study under way in terms of its 


implications for curriculum planning and development is one not review 
in the current issue. Project Talent, which was initially reported by Flan- 
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agan a others (1962), is in its infancy, of course, as far as publications 
aren — Perhaps because of this and, more likely, because of its 
an deep 1, the project was not reviewed in any previous chapter of 
m issue, This reviewer, however, is thoroughly in agreement with Mouly 
63 ), who includes in his annotations of 21 selected significant research 
studies only one study initiated since World War II, namely, Project 
Talent. Although neither he nor the present reviewer would exclude all 
other studies of the past 15 years, certainly no project of recent or current 
times offers more hope for significant research than the great accumulation 
of data available through Project Talent. 
It should be noted, too, that many other research studies supported by 
the Cooperative Research Program of the U.S. Office of Education, which 
are just under way, are not reported in the educational press yet. One can 
hope that many of these studies will offer significant data for curriculum 


planning and development. 

_ Authors of the various chapters identifie 
ings or hopes of findings of real significance. 
of Chapter V observed that there is much research dealing with the work 
of the School Mathematics Study Group (SMSG), although the research 
Currently concluded has barely scratched the surface. On the other hand, 
it is gratifying to see the scope of the research already conducted and to 
note the report that SMSG and the Minnesota National Laboratory have 
planned a new and comprehensive five-year study. If these new studies can 
eliminate the deficiencies of those already concluded, then we may be 
hopeful about the prospects of significant research related to nationally 


oriented curriculum studies. . . . 
. The authors of Chapter III reviewed many studies that have interesting 
implications for curriculum planning. Unfortunately, as they observed, 
Some of these studies simply support or replicate older research. On the 
other hand, this reviewer finds some optimism about the research reported, 

lass. Such studies as those 


Particularly with reference to socioeconomic € j 
by Coleman (1961) and Havighurst and others (1962) have great potential 


for the modification of curriculum planning and development. As a matter 


9f fact, one wonders whether research in such areas as the relationship of 
d the relationship of the peer culture 


Socioeconomic class and education an Stes 
to social and educational aspirations might not need replication frequently 
in differing communities—perhaps periodically in every community in 
o a8 * H 
order to provide basic data for designing educational programs. 
f a chapter on teaching in 


Although some might question the inclusion of a } 
the authors of this chapter in the present 


a curriculum issue of the REVIEW: ; 
lieved by the reviewer to have extreme 


issue identified some research be : 
Significance for the planning and development of classroom experiences, 
a major element of the curriculum. If the quality of pupil-teacher inter- 


action is a critical factor in the curriculum of the individual, then what 

can be determined about the teacher's role in this interaction is important 

in curriculum planning and development. Hence, such publications re- 
v 
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For example, the authors 


Review OF EDUCATIONAL RESEARCH Vol. XXXIII, No. 3 


viewed in Chapter VI as those by Barr (1961). Flanders (1961), Gage 
(1963), Hughes (1962), Ryans (1960), and Smith (1961) affirm. the 
optimism with which one views the present status of curriculum research. 

Perhaps, however, the greatest basis for optimism concerning the field 
is to be found in the studies reviewed in Chapter VIII. Interestingly enough, 
the authors of Chapter VIII seem to view the development of models with 
more satisfaction than do the authors of Chapter I. Could it be that more 
has been learned about change in the process of education than about the 
nature of the process itself? References to studies cited in Chapter VIII 
are made at further points in this chapter. 


New and Promising Areas of Curriculum Research 


Although not new, perhaps the most significant development in cur- 
riculum planning in recent years is the re-examination of the disciplines 
as sources of curriculum content. Although the author of Chapter ]I was 
unable to identify significant research in this area, he did review promising 
efforts to examine the structure of the disciplines and possible approaches 
to their utilization in curriculum planning. He pointed out that “present 
results may be viewed, for purposes of research, as an inventory of prop- 
ositions to be investigated." The analysis of the disciplines is a rich and 
promising area for research that can weld together once again the scholar- 
ship of the university and that of the elementary and secondary schools. 

The studies reviewed in Chapter III give a new emphasis to the need for 
research to identify curriculum provisions that may raise the social status 
and aspirations of many American youths, Although existing research find- 
ings ofler too little guidance for elementary and secondary schools in meet- 
ing such problems as the dropout and the juvenile delinquent, the pro- 
fession seems to know more about the backgrounds of these problems than 
ever before and to exhibit more interest in dealing with them intelligently. 
Thus, one may well hope that continued and greatly increased attention will 
be given to the remediation and prevention of such social ills through 
deliberately conceived and experimentally tested educational programs. 

The authors of Chapter VIII reported a number of leads to new an 
promising research areas. For example, the strategy of curriculum change 
proposed by Taba (1962) has never been carefully tried out on an experi- 
mental basis. This inversion approach (the present reviewer prefers the 
term inductive approach) to beginning curriculum planning and develop- 
ment experimentally through specific teaching-learning units could well be 
the subject of relatively simple and extensive research. Th^ authors of 
Chapter VIII also offer a provocative suggestion about a new and promis- 

ing area by proposing that "educators might well explore the impact of the 
structure of the profession itself upon curriculum development and cur- 
riculum change.” Although such research might be highly complex, infor- 
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mation that could be secured concerning the relationship of the professional 
power structure to the educational program itself might lead to significant 
deviations from the traditional alignment of professional personnel and 
controls. 


New and Promising Research Procedures 


It is, of course, very difficult to judge research procedures on the basis 
of reading only the reviews presented in the preceding chapters of this 
issue. On this basis only, the reviewer is of the opinion that the only new 
and promising research procedures clearly identifiable are those with 
reference to the area of Chapter VI, “Teaching.” Great ingenuity was ex- 
hibited in a number of the research studies reported in this chapter. The 
interaction analysis used by Flanders (1961) to determine the degree and 
effects of teacher influence and items of teacher and student behavior seems 
of utmost significance in continued study of the teacher-pupil interaction 
element of the curriculum. Although the technique of analysis of verbatim 
records of classroom conversation is not new, the employment of this 
procedure by Hughes (1962) and by Smith (1961) as a basis for descrip- 
tion and classification of verbal behavior in the classroom seems to offer a 
procedure for continuing analysis and evaluation of classroom experiences 
of pupils. The work of Ryans (1960) in using various techniques for the 
study and evaluation of teacher characteristics in many situations is sug- 


i -oac specific school activities 
gestive of a team research approac f sp 


in a variety of situations. : . . 
TI upil ratings to differentiate effective and 
he use by Dalton (1962) of pup sj he preparation of these 


ineffective teachers and to analyze differences in the | rae 
teachers suggests a procedure of possible significance in the study of man) 


aspects of teacher backgrounds and performance. The more aii - 
of relatively new instruments in the analysis of classroom teaching an 


upil- i i ld also be noted. : 
P hee cia gemit made a very interesting € about 
the dissemination of educational research: some type of isseraination 
Service or agency that would function in a manner comparat e is m 
County agent in agriculture to help teachers and administra cy iem e the 
best research results available to the local school situation. bes haps, a 
true breakthrough in education could be greatly aided by establis! ing, in 
a large number of school systems, the position of research commision, 

The reviewer would like to call attention to the interesting and careful 


method usec by the authors of Chapter V in the preparation of their 
4 ds used in this chapter, such as selection of 


material. Application of standar | 
items for enum as well as the check by telephone conversations with 
Persons actively engaged in major research projects, would probably 
improve the quality of many chapters in future issues of the Review. 


h to studies o 
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A Theory of Curriculum Planning and Development 


The authors of Chapter I found great need for a theory of curriculum 
planning and development. They identified some efforts in this direction 
but, as yet, no significant formulation of a comprehensive theory. They 
concluded that the job is yet to be done and indicated stages and steps 
suggested by the publications reviewed. 

In the reviewer's judgmem, the authors of Chapter VIII noted the 
greatest current contribution to a theory of curriculum planning and devel- 
opment in their treatment of the “diffusion of innovations." They demon- 
strated their critical approach to this matter through clear-cut criticisms of 
the limitations of the Brickell (1961) report. At the same time, they 
recognized the broader view of diffusion research taken by Rogers (1962) 
and also the comments of Mackenzie and Huebner (1962) on the political 
qualities of curriculum change. Such studies as these indicate the tremen- 
dous possibilities of more rapid and comprehensive utilization of significant 
research on the curriculum once the latter is available. . 

The authors of Chapter VIII also presented reviews of some of the signif- 
icant work done in the studies of roles, role perception, and personality 
structure. Although one might expect to find some of these studies as wel 
as those of creativity reviewed in Chapter IV (which the present reviewer 
did not have opportunity to read in the preparation of Chapter X), never- 
theless, such work as that of Flanders (1960) and of Bills (1963) offers 
significant insights into the operation of curriculum planning organiza- 
tions. Certainly, the ways in which the individuals concerned view them- 
selves and their colleagues must occupy a central place in any theory 9 
curriculum planning and development. 


Looking Ahead 


As indicated in the introduction to this chapter, the reviewer is well 
aware of the paucity of significant, completed curriculum research studies. 
He quite agrees with the authors of Chapter V that “too few studies are 
replicated" and that "too few studies have an end beyond themselves." He 
is impressed, too, with the final recommendation of these authors: “This 
journal could perform a vital service by rigorously selecting fewer studies 
for review, by analyzing each study in depth, and by candidly reporting 
its strengths and weaknesses." N 

This final chapter, however, has attempted to indicate tne reviewer 5 
judgment of some of the promising beginnings of significant research and 
the development of critical areas and ideas for exploration. His optimism 
is most aroused by a seeming ground swell of interest in, even beginning? 
of, long-term, carefully conducted, curriculum-oriented research studies. 
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FOREWORD 


Up to June 1958, research in teacher personnel had been reviewed at 
three-year intervals. The present issue of the Review is concerned with 
research in the same topic over a five-year period. 

With respect to the studies of teacher personnel, chairmen of the June 
1955 and June 1958 issues of the Review agreed that there was little good 
research to be reviewed and that there was need for further study and 
for development of new techniques. To a large extent, this is still true. 

Some portions of the subject studied did not lend themselves to re- 
search other than the descriptive or normative types. Other portions, how- 
ever, proved to be sources of both intensive and extensive research during 
the past five years. In two or three instances, the amount and quality of 
research was more than could be reviewed adequately, due to limitations 
of space and time. For this reason, a return to the three-year cycle is 
suggested. 

The next committee to review research on this topic may wish to under- 
take some internal reorganization of the subject. From what has been 
discovered by the present chapter authors, it seems likely that the contents 
of each of two of the chapters might well have been organized as four 
separate chapters. On the other hand, certain other chapters might well 
have been combined into one. Unfortunately, the quality and volume of 
research did not reveal themselves until work was well under way; at which 
time it was too late to attempt major reorganizalion. 


Howarp A. Bowman, Chairman, 
Committee on Teacher Personne 
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CHAPTER I 


Teacher Certification, Supply, and Demand 


JAMES C. STONE 


E 
o 


Treacuer certification, supply, and demand are so interrelated that sepa- 


ration for more than descriptive purposes tends to neglect the most 
important aspects of analysis. The pervading theme throughout the litera- 
ture reviewed attests to the need for an adequate supply of fully qualified 
personnel in the public schools. The issue is compounded, however, for 
there is no precise agreement on the meanings of any of the terms cited. 
although the National Commission on Teacher Education and Professional 
Standards (NCTEPS) and other units of the National Education Associa- 
tion have been working for common practices and understandings. 


Certification 


e states actually have been 


r certification in som 
s, according to Hoden- 


Requirements for regula 


lower than those for emergency certification in other: 
field and Stinnett (1961). This has been a major problem of the whole 


situation of certification in the United States. If the inequalities between 
emergency and standard certificates emerge as a problem, that of the 
Comparability of standard certificates among states 15 one of truly signifi- 


cant proportions. 


State-by-State Variations 


Armstrong and Stinnett (1962): Hodenfield and Stinnett (1961), and 
d fic differences in standard cer- 


Woellner and Wood (1962) reported speci : 
Uficates for elementary and secondary teachers among different states. 


rizona, California, and the District of Columbia require a master's degree 
Or 30 semester hours of postgraduate work for a high school certificate: 
Onnecticut, Indiana, Maryland, New York, Oregon, Pennsylvania, and 

ashington require a fifth year of preparation for full certification for 
Secondary school teachers (Armstrong and Stinnett, 1962). Arkansas is 

e only state not requiring à bachelor's degree for secondary teaching, 
although it is planned as a stale requirement sometime in 1963. The 
addition of Arkansas in 1963 will represent a three-state gain since Herge 
(1958) completed his review of certification requirements. 


b oellner and Wood (1962) reported that in only six states the 
achelor’s degree is not a minimal requirement for regular certification 
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of elementary school teachers. Since 1958, seven states—Alaska, Colorado. 
Iowa, Kansas, Kentucky, Minnesota, and Missouri—have changed to a 
minimum standard of a bachelor's degree for elementary teachers. Those 
states not requiring this minimum—Maine, Montana, Nebraska, North 
Dakota, South Dakota, and Wisconsin—tend to be somewhat sparsely 
populated with smaller and more isolated school districts (Armstrong 
and Stinnett, 1962). 

The trend toward the requirement of a bachelor’s degree also was 
recognized within the professional organizations of at least eight states 
by the end of 1961 (Armstrong and Stinnett, 1962). One anticipates that 
this trend will continue at the state level as more states develop higher 
standards for full certification. 


Variations in Certificates 


Within the framework of eligibility for standard certification, other 
major points of divergence among states pertain to the nature of the 
specific requirements in professional education, subject matter, specializa- 
tion, general education, citizenship, special courses, and loyalty oaths. 
In discussing several of the inequities among states, Barnes and Shipman 
(1961) found that the requirements in professional education for ele- 
mentary school teachers in 1960 ranged from 16 semester hours in three 
states to 36 semester hours in one state and that for secondary teachers 
the same kind of requirements varied from 12 semester hours in two 
states to 27 semester hours in one state. Moreover, the number of hours 
required for student teaching fell between zero and 10 semester hours. 

Barnes and Shipman also concluded that general education requirements; 
which are not specified in most states, continue to receive less attention 
than areas of subject matter specialization. They also noted that require- 
ments in the specialized subject matter of the applied arts and sciences 
were higher in number than were the subject matter requirements in the 
traditional academic areas in high school. In a corresponding study, Tuttle 
(1963) determined that the applied fields were requiring two to three times 
as much preparation as the traditional fields. He urged that the differences 
in requirements be narrowed. Of a related nature was Armstrong an 
Stinnett’s (1962) report that the most significant recent change in certifi- 
cation has been the general upgrading by the states of the subject matter 
requirements in the traditional academic areas. 


Levels of Responsibility 


State boards of education are the most common agencies for the issuance 
of certificates, according to Armstrong and Stinnett (1962), Barnes an 
Shipman (1961), Latimer (1963), and Woellner and Wood (1962). 
Although Barrett (1961) found that centralization of authority for certifica- 
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ES oe State Board of Education was associated with the 
standards increasingly being demanded of the state’s teaching 
personnel, Lindsey (1961) supported the thesis that the profession itself 
must take more responsibility for the enforcement and administration of 
the licensing system than it has taken in the past. Evidence that this point 
of view is gaining momentum may be noted in the writings of Allen (1961), 
Barnes and Shipmar (1961), Bush (1961), Corey (1961), Davies (1962), 
Latimer (1963), and Yeager (1960). ° 
One of the most pronounced indications o 
responsibility being placed upon the professio; 
approved program approach for meeting certification requirements. It is 
through this approach that NCTEPS (Lindsey, 1961) has attempted to 
establish guidelines and standards which provide an increasing degree of 
reciprocity among the states. Barnes and Shipman (1961) reported a 
Northeast 11-state compact for reciprocity and a Midwestern agreement 
that provides freedom of movement across state boundaries with minimal 


difficulty. Davies (1962) mentioned that 27 states have agreed to accept 
proved by the National 


for certification any graduate of an institution ap 

Council for Accreditation of Teacher Education (NCATE). Lander (1963) 
noted that approximately 400 of the 1,100 colleges and universities offering 
teacher education programs have the endorsement of NCATE and that more 
than one-half of these institutions have entered into reciprocal agreements 


for certification of teachers on the approved program þasis. Latimer (1963) 
declared that complete reciprocity in certification of teachers is present in 
20 states, Authors who reported upon approved programs were quick to 
point to the added responsibility that such a process placed on the institu- 
tions for careful selection and screening for teaching credentials. 


f the increasing amount of 
n has been through the 


T P” 
eacher Examinations 


ussed various certification approaches that 


A numb i i 

ES er of writers disc s 4 

utilize some form of examination of teaching competency. Davies (1962) 
dent of the preparing institution and of 


Fana for examinations independent 
e employing school district. Latimer (19 


Certification from subject field to subject fiel : 
9n an examination that reflects the cooperative planning of professional 


Organizations and learned societies. He also would restrict courses from 


Peing offered by local schools unless a qualified teacher was available to 
9 the teachi mim lling attention to the West Virginia plan of using 
ing. After calling ledge of the teaching field, 


€ National Teacher Examinations to test knowle 9 
tiles (1959) reported a consideration by the Wisconsin State Department 


cà Public Instruction to use written examinations for potential teachers. 

lese examinations would furnish measures concerning breadth of liberal 
arts background, knowledge in specific subject fields, and familiarity with 
Pjectives in professional education courses. Taking a position different 
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from that of Davies (1962), Stiles urged that the institution in teacher 
education maintain the responsibility for administering the examination 
and for interpreting the results—an activity that would furnish one basis 
of recommendation for certification. Yeager (1960) suggested an *extra- 
official, professional board," which would be closely allied, but not tied. 
to the state certification agency as the proper examining and recommend- 
ing agency. The board would represent the profession in certification pro- 
cedures. Greene (1961) also called for close cooperation of the profession 
and the state. 

In noting that 22 states made use of some sort of examination for 
meeting various state certification requirements, Slaughter (1961) believed 
that the apparent trend in recent years toward the use of examinations must 
be the result of an attempt by the states to make certification procedures 
more flexible than they have been in the past. 

Along with increased use of examinations has come a decrease in the 
number of states requiring special courses on a state or regional basis. 
However, Armstrong and Stinnett (1962) pointed out that there are still 
a number of states which require U.S. citizenship, a loyalty oath, a general 
health certificate, a chest X ray, or special courses for certification require- 
ments. Woellner and Wood (1962) reported that there were 579 separate- 
name certificates issued by the states in 1961 and that the number per 
state ranged from 2 in three states to 68 in one state. This represents a 
decrease in the total number from 630 in 1959, as noted by Hodenfield 
and Stinnett (1961). Hodenfield and Stinnett also stated that the most 
common certificate issued is one that carries an endorsement indicating 
the field or area of specialty. 


Trends 


Many authors attempted to spot trends in certification of teachers. 
Although more states than formerly are accepting NCATE-approved pro- 
grams as a basis for certification, the United States is still a long way from 
national certification (Hodenfield and Stinnett, 1961). Lieberman (1961) 
outlined the advantages of national certification and concluded that basic 
changes in the approach to certification would be necessary before such a 
procedure was possible. Voicing a view of NCTEPS, Lindsey (1961) 
reinforced the program plan approved by NCATE as a necessary aspect 
of professionalization and as a standard of certification on a national level. 

Armstrong and Stinnett (1962) reported that all states requi-e certifica- 
tion of all public school personnel who have specific responsibilities related 
to pupils. Included are administrators, nurses, guidance counselors, Psy 
chologists. and social workers. Within the general requirement that ies 
personnel be certified, the National Education Association, Rem 
Division (1962a) estimated that 91,556 teachers were employed on the 
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basis of emergency certificates in the 1962-63 school year. This figure 
indicated that 1 teacher in 16 was serving on a substandard certificate. It 
was also estimated that approximately 70 percent of these emergency- 
certified teachers were employed in the elementary schools and 42 percent 
in rural schools. Approximately 47 percent did not have at least four years 
of college preparation. This fact takes on significance when one considers 
that six states do not require a bachelor’s degree for full certification for 
elementary school teachers. The National Education Association, Research 
Division (1963b) reported that 6.3 percent of the elementary and. 4.3 
percent of the high school teachers had less than full teaching qualifications 


for their states. i 
The supply of qualified teachers for public schools in the United States 


remains the foremost problem in public coe today. Ge pese: 
Educati ssociati arch Division (1963a) reported a total o 
cU seme dh mani hools for the 1962-63 school 


1,509,281 classroom teachers in the public sc 
year. The Division also estimated a net shortage of 134,000 teachers for 


this same period. 


Supply and Demand 


Maul (1960b) called attention to the factors both within and without 


y d demand 
u ional sy. ^e had an effect on the supply of an 
1e educational system that have h rue of several external forces, hs 


or qualified teachers. To give ex a : 
pointed to increased birth satis mobility of the SL bogie os 
of young families in such temporary locations as areas "S a g P dd» 
or military installations, and desires of logal hail d the te 
education over and above the state minimum. Maul also liste $ 

h tend to contribute to 


3 hick 
Conditions ‘ch ste for teaching, whic r 
“oly high standards as opposed to the low retention rate of 


stability and mini ver. ; 
a minimum turnover, f het 
Communities with relatively low standards. He went re qu v 
economie factors, rather than just the number of qualifie r Ue 
Benera] poridatios vhs are capable of teaching. - cc: the most influ- 
ial i : ; em. 
€ntial condition underlying the supply-demand pro 


Economic F actors 


; i iderations to the 
i ; of other economic consi : 
ply e e aiala d in a large number of studies. Fox 


“ppl d bstant f 
(1961) oe Hills and Downs (1962), Lambert (1963), Maul 
(1961) : M Ken ( d Kershaw (1962), the National Education Association, 
bg 960), Seltzer (1962), Stone 


ivisi ichards (1 
O61) A Division (19628). SER Pelt (1958), and Walker (1958) 


(1961), Tyler and Miller (1962 
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provided data that supported Maul's emphasis on economic factors as 
important determinants of the supply-demand ratio. 


Subject and Grade Level Factors 


The demand for teachers is one not only of numbers but also of distri- 
bution of those who prepare fur teaching. Maul (1960a, b, c, 1961). 
McKean and Kershaw (1962), the National Education Association, Re- 
search Division (19602, b, 1962b, c), Osborn and Maul (1961), and Stone 
(1961) referred to the need for an improved balance between subject 
matter fields in greatest demand and those consistently oversupplied. 

Bartels (1960) explained the Index of Teacher Demand for 1958 as the 
ratio of the number of new teachers employed who had not taught during 
the previous year to the number of teachers newly prepared by colleges 
and universities. It is expressed as the percentage which demand is to 
supply. Although Bartels presented information on only 29 states and 
2 territories, his findings proved interesting. The Index for the high school 
was 83 percent; for the elementary school, 188 percent. 

The National Education Association, Research Division (1960a, b. 
1962b, c) has reported a continuing condition of eight new elementary 
positions available for every five high school positions; the ratio of supply 
has been the reverse. An encouraging trend in the differential between the 
numbers of elementary and secondary school candidates who prepared 
for teaching between 1950 and 1955 revealed an uncertain reversal during 
the years between 1955 and 1960. The National Education Association, 
Research Division (1962b, c) reported an increment of 13.9 percent of 
the new prospective high school teachers as opposed to an increase of 
5.1 percent for new prospective elementary teachers over the previous year. 
This circumstance did not decrease the need for teachers in the elementary 
schools. Although only 68 percent of the high school and 83 percent of 
the elementary school trainees who prepared for teaching each year actually 
entered teaching, the difficulties of supply have become increasingly acute 
in certain fields of high school teaching. 

A gross example of oversupply was in the area of men's physical educa- 
tion. Of the 7,700 graduates with major preparation in this field, fewer 
than one-quarter obtained full assignments in physical education; of the 
total graduates in physical education who obtained positions, more than 
17 percent were teaching outside of their major area of preparation. Social 
science was next in the order in wh:ch supply exceeded demand. Home- 
making education and women's physical education were at the other end 
of the scale, with demand far exceeding supply. The physical sciences and 
mathematics were also fields of continuing extreme shortage. These supply- 
demand ratios at the high school leve] were almost lost in the tremendous 
need for elementary school teachers, There appears to be little doubt that 
the area of greatest need will continue to be in the elementary field. 
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Other Factors 


E | In addition to the economic factors, there have been other conditions 
that contribute heavily to the deficit in supply. The continuing increase 
of children of elementary and high school age shows few signs of abating. 

The National Education Association, Research Division (1963a), which 
reported 20 percent of the U.S. population to be in the school age bracket 

(between 5 and 17 years), estimated thát by 1970 the population in this 

age group will be at least 6 million more than it was in 1950. It also was 

estimated that the same school age group will constitute 25 percent of the 
| general population. Clearly, the need for teachers will increase. 

Folsom (1958), Harris (1957). and Walker (1958) called attention to 
the indifferent or negative attitudes of some of the public toward teachers 
and emphasized the detrimental effect such attitudes have on keeping 
teachers in the classroom. Fox (1961), Maul (19602) , the National Educa- 
tion Association, Research Division (1960a, b, 1962b, c), Osborn and 
Maul (1961), Stone (1961), and Van Pelt (1958) called for efforts in 
counseling youth about the specific areas of need in teaching. Hills and 
Downs (1962) pointed out that the profession has failed to offer new 
teachers opportunities for advancement that are commensurate with their 
ability level. This shortcoming well may discourage many college students 


from entering teaching. 

Maul (1960b) reported that a 
turnover but also exerts a stabiliz 
describing factors that contribute 
and Oklahoma schools, respectively; 
also found that high standards, good salaries, and 
tended to go together. 

The differences in state c 
Working conditions also have an @ 


demand of teachers. The National Educatior 
(1962c) pelea T percent of the graduates during the 1960-61 school 


year who were certified for teaching took jobs outside of the state in which 
they had prepared. This figure. compared to 14.9 percent, cited two years 


: s i iation, Research Division 
previousl National Education Association, R 
y By the ht of the teachers in the group. This 


(1960b) , accounted for one out of eig : h 
report also drew attention to the fact that states which had a high percentage 
of college graduates among the total elementary school corps of teachers 
were able to make further progress in recruiting highly qualified teachers, 
Who, in turn, would tend to raise the total state average. On the other hand, 
those states with total elementary staffs at the lower levels of preparation 

th less than the national average 


tended to be inducting new teachers with 1 a 
amount of preparation. This underscores an important trend—the widening 
of the gap between states with higher standards and those with lower 


Standards. Coupled with this trend is the probability that the quality 
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teacher in a low standard state can find greater economic rewards and 
better teaching conditions outside the state in which he was educated. 

Both the National Education Association, Research Division (1962c) 
and Stone (1961) reported that approximately one-half of all persons 
certified each year in California have been prepared in out-of-state insti- 
tutions. Stone (1961) also reported that California colleges and universities 
that were accredited to prepare teachers supplied only 52 percent of the 
total number of new teachers needed in 1960. 

The mobility of teachers is a prime factor in the inability of researchers 
to assemble precise data on the supply and demand of teachers. Hodenfield 
and Stinnett (1961) reported on the difficulties teacher mobility brings to 
standardization in certification—a mobility which challenges the state 
agencies responsible for yearly estimates of supply and demand. The general 
movement of the population to the West and the Southwest and from non- 
metropolitan to metropolitan areas, especially the suburbs, was reported 
by the National Education Association, Research Division (1961). Tyler 
and Miller (1962) called attention to the special need for teachers in 
growing urban areas. 

Stone (1961) reported that one-third of the teachers in California had 
been educated outside the state. One-third had taught in other states, and 
only one-half had taught in only one California school district. He noted 
that a slightly higher percentage of high school teachers than of elementary 
teachers remained in one district. Somewhat disconcerting was the finding 
that 27 percent of the candidates trained in and certified by California 
failed to enter teaching. : 

Osborn and Maul (1961) noted that approximately 61 percent of the 
elementary school teachers prepared in Iowa remained to teach in that 
state and that approximately 28 percent went into elementary schools in 
other states. The remaining new teachers did not enter teaching. Approxi- 
mately 27 percent of the candidates who prepared for the high school 
level did not enter teaching. and only 53 percent of those trained remained 
in Iowa. 


Remedies for the Shortage 


Several remedies for the shortage of teachers have been suggested. 
Folsom (1958) suggested retaining more of the qualified teachers already 
prepared. He stated that. if the profession could extend the average teaching 
career by two years. the shortage practically would be eliminated. With 
marriage and maternity reported by Harris (1957) and by Stone (1961) 
to be the two major causes for teachers leaving the professica, there is 
little immediate hope for Folsom’s suggestion answering the need. 

As implied previously, substantial increases in compensation would " 
the major step toward solving the shortage. McKean and Kershaw wae 
suggested that, since the shortages in teaching are specific, T one. = 
approached through special salary incentives which would be offe 
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fanse falls pim shortage. They challenged the across-the-board salary 
mamie ile as highly expensive and, in fact, declared it a contributor to the 
ntinued disproportionate supply of teachers. 
ren studies as those by Fox (1961), Richards (1960), Tink (1960), 
al Van Pelt (1958) attested to the importance of teacher influence on 
college students who entered teacher education programs. The tendency was 
for the elementary school teachers of these college students to have had the 


most marked influence. d 


m (1962) indicated that the high r 
school teachers might be influenced by providing a subject matter major 


» the elementary school teacher education program. This was based on 
fae finding that the interest in a particular subject field was an important 
actor in the choice of prospective teachers who entered secondary education 


programs. 


. Phi Delta Kappan (1963. p. * 
50,000 men and women will retire annually from military service. Nearly 


one-half of these persons have bachelor's degrees. Moreover, two-thirds 
of the retiring officers have expressed an interest in education as a second 
career. Shriver (1963) stated that almost. two-thirds of the 5,000 Peace 
Corps volunteers are working in education fields. He felt that the eagerness 
of these people to enter teaching might be diminished by certification 
requirements. He also indicated that various universities have plans to 


furnish Peace Corps members with educational and certification credit. 


Phi Delta Kappan (1963, p. 355), which reported a poll of the first 700 
returning Peace Corps members, revealed that approximately 25 percent 


looked forward to teaching careers. 
The solution to the quantity problem of supply 


Sight for some states. Stone (1961) projected t 
teachers to surpass demand by the academic year 1970-71. This look to 


the future does not, however, take into account the need for replacing 
teachers who currently are serving on substandard credentials. Quality 
teachers appear to be in demand for some years in the future. Phi Delta 
Kappan (1963, p. 353) reported an interpretation by Coulter of national 
figures. in which he also visualized an easing of the teacher shortage by 
1970. The interpretation Was based on the fact that between 1960 and 1970 
^n increase in enrollment of 24, percent and an increase of 79 percent in 
the number of college graduates will take place. Coulter contended that 
the college ssudueke is the primary source of supply for the teacher 
Shortage "and. shat thè increased number of graduates alone will be the 


e 
ief factor in reducing the shortage- 


atio of high school to elementary 


5) reported that in the next decade 


and demand appears in 
he California supply of 


Needed Research 


called attention to the lack of research 
Other areas of needed research 
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include the relationship of certification to the quality of teacher competence, 
the instrumental value of certification in the control of teacher education, 
and the sociological implications of national standards of competence and 
certification. The problems of supply and demand leave many questions 
unanswered. Research is needed on how to find out the information which 
is required for making accurate prognoses concerning fields of teacher 
demand and means of meeting them. The mobility both of the nation's 
population and of teachers suggests the need for improved methods of 
locating teachers and of keeping in touch with them over a period of years. 
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CHAPTER II 


Recruitment and Selection of Prospective 
Elementary and Secondary 
School Teachers 


GLENN W. DURFLINGER 


Ox: or the best statements of the point of view in the teaching profession 


regarding the recruitment, selection, and guidance of prospective new 
members is Chapter VII of New Horizons for the Teaching Profession, 
a report of a project sponsored by the National Commission on Teacher 
Education and Professional Standards (Lindsey, 1961). Titled “Tdentifi- 
cation, Selective Admission, and Retention in Teacher Education,” the 
chapter reflects the thesis that it is the responsibility of the teaching pro- 
fession to determine what is required in training a successful teacher in 
terms of basic student characteristics before admission to the program, 
In recruiting and selecting candidates on the basis of this knowledge, and 
In exerting reasonable efforts to retain successful teachers within the active 
Professional ranks. This point of view sets the stage for the research 


reviewed in this chapter of the REVIEW. 


Recruitment 


The most important problem pertaining to recruitment is obtaining 
enough qualified teachers for the ever-increasing multitude of children in 


the nation’s schools. School populations are expected to increase by approxi- 
mately 1 million each year for at least a decade. This increase requires 
an annual addition of nearly 50,000 teachers. Since approximately 150,000 


teachers retire from or leave the profession each year, a constant annual 
s exists. Furthermore, there are 90,000- 


need for 200.00 age sher 

.000 additional teacher $ S i 
00,000 teachers with substandard teaching credentials—teachers who 
need to acquire additional training 3% soon as possible. Research on recruit- 


me: " 2 

nt problems is desperately needed. . ist 

One of the most comprehensive surveys of recruitment was mace Dy 
najor state and national organizations 


age (195 "ted on the m 
or ee lacia study involved a state-by-state analysis of 
the efforts expended by the National Congress of Parents and Teachers, the 
tudent National Education ‘Association, the Future Teachers of America 
"TA), and other facilities to encourage young people to prepare for 
and to enter the teaching profession- Financial aid programs available 
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to students were also listed. Such information as was revealed in this 
survey should be in the hands of high school and college counselors. In 
view of the urgent need in the nation's schools for reduction in the 
number of overcrowded classrooms, of restricted programs, and of limited 
curriculums, a survey such as that by Page should be conducted regularly 
to reveal the extent of progress in recruitment efforts. 


Factors in Decision Making 


A number of studies sought answers to questions tangentially related 
to recruitment. In a questionnaire study, Fox (1961) sought the factors 
underlying the decision to become a teacher. The following characteristics 
were found to be significantly related to the decision: (a) desire to work 
with children or adolescents, (b) liking for a particular subject. (c) 
opportunity to leave the profession and to return to it later, (d) increasing 
salary trend, (e) scores on inventories of vocational interest, (f) possibility 
of using teaching as a steppingstone to another career, and (g) membership 
in an FTA group. Factors b, d, e, and f were associated more strongly with 
the decisions of the men than with those of the women; and factors a, c, a^ 
g were more highly related to the decisions of the women than to those of 
the men. 

Tink (1960) studied the factors related to the choice of teaching as ? 
vocation. The responses from 914 high school students, 399 college 
students, and 132 teachers indicated that four influential factors in 
choosing teaching were (a) teaching is interesting and satisfying; (b) it 
is challenging and imaginative; (c) it offers opportunities for working 
with a favorite academic subject; and (d) it affords an enjoyable working 
relationship with pupils. It also was noted that experiencing harmonious 
relationships with people was very important. Reasons for not choosing 
teaching were (a) having a different vocational interest, (b) perceiving 
existence of unfavorable teaching conditions, and (c) feeling persona 
inadequacies. 


Age at Making Decision 


Tink (1960) found that women decided earlier than men to enter 
teaching. This conclusion was also reached by Fox (1961) and by Mason 
(1961) in a study at the U.S. Office of Education. There was only a 
slight discrepancy among the studies concerning the age at which the 
decision to enter teaching had been made. Tink also reported that one- 
third of the 389 college students in his sample decided to be teachers 
before they entered the tenth grade, The U.S. Office of Education survey 
revealed that 21 percent of the women and 7 percent of the men T 
on a teaching career before the eleventh grade (Mason, 1961). Fox (19 
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) i rum that about 80 percent of the women in preparation programs for 
of | 15 as well as a high proportion of prospective elementary teachers 
zs Poth sexes had decided favorably about teaching before high school 

d Paati whereas only about 30 percent of the men and 50 percent of 
hei prospective candidates for secondary school teaching had done so 
before high school graduation. 

In a questionnaire.study at the Ohio State University, Riccio (1961) 

Of the 488 freshmen entering the 

age ages at which the male and 

respectively, 17.05 years 


found somewhat the same results. 
"mas preparation program, the aver 
emale candidates had made their decision were, 
and 15.08 years. 
FP ae studies revealed that individuals sampled perceived that teaching 
t ers the opportunity to leave the profession and to return to it later. Fox 
belii found that factor to be related statistically to decision making. In 
p a Office of Education study by Mason (1961), more than four out of 
ive of the women in their first year of teaching said that they would want 
"erm d from homemaking to teaching at some future time. The oppor- 
5 My in teaching to take time off for homemaking and for children is 
potentially important in recruitment. 
Piva ied (1962) studied graduates of the University of Georgia to Ms 
f 1 programs most often led toward teaching. Students with a bac elor 
of science in education and those with majors in business education, 
elementary education, and physical education more often went into 
teaching than those with majors in agricultural education, English, or 
Social sciences, Men were more likely than women, and younger students 
Were more likely than older students, to change their minds about entering 
teaching, He also noted that the minimum entrance requirements for the 

niversity of Georgia served to select students with sufficiently high 
academic ability to teach successfully. The secondary school background 
and the freshman placement examination were not definitive factors 


relativ A 
ative to who would enter teaching 


n Dutton and Keislar (1961) determined tha s 1 
9t possess a significant amount of correct information concerning the 
signific 


teaching profession. They also found that boys thought teaching would 
€ less remunerative than did girls. Students from an upper socioeconomic 


Stoup believed that their parents would be unhappy about their choice 
h students from the middle and lower 


so aching a althoug! 
a c ioeconomic groups felt ae their parents would be pleased. The authors 
ad aded that attitudes toward teaching were formed long before high 
| 90l graduation. 
^ Ne has investigation of the 
entering’ Hills and Downs (1962 
opporte the teaching profession bec 
the 9 nity for advancement—a cone 
ne reached in the study by Dutton 


d that high school students did 


attitudes of college students toward 
) concluded that men were opposed to 
ted income and restricted 


ause of limi 
Jusion essentially in agreement wilh 


and Keislar (1961). 
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Selection 


In the area of selection of students to pursue a teacher preparation | 
program, research efforts either must rely upon previous investigations in 
the measurement and prediction of teacher effectiveness or must make 
some unique contribution to knowledge in this arca. , 

In the follow-up of Stout’s (1957a, b) study of selection procedures, 
Edson and Davies (1960) attempted to obtain information concerning 
both nature and influences of changes taking place in selection procedures 
between 1953 and 1958 in Minnesota colleges and universities. They 
found that there was an increase in the use of records and measures of 
academic achievement and in employment of indexes of competence in 
English usage as bases for selection. There was also an increase in the 
use of faculty committees in the selection of students. Colleges with 
relatively low or moderate selectivity standards were sending more teachers 
into the schools than were the colleges with higher standards. 

Rutherford (1961) conducted a questionnaire study to determine current 
practices in selective admission and in selective retention in certain teachers 
colleges. Altenhein (1962) described the guidance and selection program 
at Hunter College. Although there was little systematic inquiry into the 
characteristics of the total program in teacher training. considerable 
research was undertaken concerning elements or details of student char- 
acteristics in the selecting process. 

One such study at the University of Southern California by Michael 
and others (1961) indicated some of the types of instruments employe 
in the selection process as well as the results of their use. They found that 
success in a teacher preparation program could be predicted to the degre 
that about 20-25 percent of the variance in the criterion variable was 
explained. The battery of tests consisted of the social studies, literature. 
science, fine arts, and mathematics subtests of the Cooperative General 
Culture Test; an English fundamentals test developed at the University + 
and the reading vocabulary, reading comprehension, arithmetic reasoning: 
and arithmetic fundamentals parts of the California Achievement Tests. 
When the results of a scale called Rating Sheet for Directed Teaching- 
the grade point averages in methods courses, the ages of students, an: 
their classifications in college were added to the other measures cited. 
the percentage of predicted variance rose to above 40. 


a 
Criteria of Selection | 
^ 
1 H m ing 
In a questionnaire study of 48 universities and 132 colleges gar sh 
the type and place in the students’ program of the use of selective = n 
Magee (1961) found that most institutions selected students (a) on 
hegi r division and (b) at entrance into student teaching 


beginning of the uppe 
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5 
rtp na of the institutions imposed some form of official hurdle 
"iiie upper ivision level, and 55.6 percent required formal admittance to 
P ivision professional courses. A grade of C was generally adequate 
to professional courses, and only 3.3 


5 de: 
for admission to upper division or 
andard of 2.5 or greater. Only 35 


percent reported a grade point st 
uu. Janandes a grade point average higher than 2.0. Moreover, 
ronda ee which were applied primarily to secondary teaching 
cd es, were imposed variously upon (a) total work, (b) professional 
ses. and (c) major courses. 
co his research to 97 elementary school student teachers, Darrow 
wale mas a multiple R of .33 between the criterion of a teacher rating 
(MTATY the combination of the Minnesota Teacher Attitude Inventory 
Educa ), lowa Tests of Educational Development, American Council on 
fiain E Psychological Examination for College Freshmen, total college 
our ratio, and high school grade average. He recommended con- 


li e ¢ 0 
nued use of the first three of these measures in the selection of prospective 
of the composite of 


e candidates. Although the validity coefficient 1 
rss was not high enough to convince one that the instruments 
b ployed were the most effective ones. the relatively low R might actually 
" the result of the use of a teacher rating scale of low reliability and of 
questionable validity. Rabinowitz and Mitzel (1961) contended that, if 
ie predictors employed resulted in a multiple us mg her 
tows, 0 with a reliable measure of teaching success, one wes screening out 

© many potentially successful teachers and leaving in too many unfit ones. 
( nother study with somewhat negative results was that of Robinson 
1962), On the basis of the criterion of g g, he found 


l going into teachin 
that the predictive validity of such instruments as the MTAI. the Guilford- 
Zin ) 

"merman Temperament Survey. and t 


he Nelson-Denny Reading Test 
Was 1 KENT 
$ so low that he recommended against their use in 


accepting or rejecting 
tudents for the teacher training program. Fielder (1960) obtained similar 
terion, 


r i 1 : ne 
results when he employed this same cri i.e., entry or nonentry into 
: assroom teaching He also employed the criterion of continuing or nol 
continuing : io " Meo 

Muing in the teacher preparation program. 


A " 

tademic Competence 
the academic quality of prospective 
bnick (1961) summarized several 
clusive. Using the 
School Content 
ination, Snider 


Nes studies were concerned wis 
types s found in colleges. Mitzel and Du 4 
es of evidence which they declared to be incon 
lio: State University Psychological Test. Iowa. High 
and i"4ion, and a locally constructed mathematics exam 1. Sni 
phar Ong (1961) compared majors in business education. peed 
Macy, and education. Prospective teachers did as well as the business 
en majors and the engineering majors. but did not reach the level 

à 1 The authors suggested that, since 


t 
© pharmacy students in these tests. 
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measures of academic aptitude were available, they would be employed as 
the chief or only screening device unless more valid ones should be 
developed which would measure other characteristics. Cummings (1960) 
reported that students in the College of Education at the University of 
Alabama did better on tests of intelligence than did nonvocationally 
oriented students. However, the female sex far outnumbered the male sex 
in numbers in this college and in other vocational!y oriented programs. 
Moreover, the women did son.ewhat better than the men did on the tests 
of intelligence at this university. ` 

On the other hand, Vertein (1961) studied a group of students from 
Wisconsin State College and Institute of Technology at Platteville who were 
preparing to be teachers and found them to have an average academic 
aptitude below that of the students not planning to teach. Both groups were 
below the University of Wisconsin academic average. However, most of 
them came from a restricted geographic locale, namely, southwestern and 
rural Wisconsin. 

There is a considerable list of concomitant sociological factors of current 
professional interest and significance to the topic of the academic or 
intellectual aptitude of teachers in training. An important study by Statler 
(1959) included 257 students at the Ohio State University who were 
beginning their professional course sequence. Nearly 60 percent of the 
fathers were employed in business, in production, or in agriculture. Half 
of their families had annual incomes below $7,500—a figure which was 
2.7 below the national average. Fifty-six percent were graduated from @ 
high school of 150 students or fewer, Eighty-five percent of the men anc 
55 percent of the women transferred from another major into a teaching 
major. Constituting 38 percent of the total sample, men preferred to teach 
in science or mathematics; women preferred language or social studies- 

In short, there was inconclusive and conflicting evidence concerning 
whether the students who were preparing to be teachers or who were hopefu 
of becoming teachers were of lower academic aptitude than were either the 
students of universities in general or the students preparing for other fields 
of life work. Moreover, the data were confused by other correlated factor? 
such as socioeconomic background of candidates and future economic 
prospects after entering the profession. h 

Making usc of several college groups including those preparing to teac™ 
Van Wagenen (1961) attempted to determine the relationship henen 
academic achievement and personality factors revealed by the Edwart" 
Personal Preference Schedule (EPPS) , the Guilford-Zimmerman Tempera 
ment Survey (GZTS), the Strong Vocational Interest Blank, a scale gcn 
cerning rigidity of attitudes regarding personal habits, a questionar 
about personal data and background, an anxiety scale, and an pend : 
beliefs. For the most part, his data indicated that none of the — : 
these instruments was consistently related to grade point pda n cd io 

less of the college group considered. Those planning to teac 
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t rank higher i iabili iendli 
se higher in ascendance, sociability, and friendliness in the GZTS than 
id students in other groups. 


Career Commitment and Selection 


Ns individual preparing for a career in teaching spends considerable 
E ii and time, and the community invests heavily in education. As 
ssi previously, one of the important arguments to be employed in 
— is the possibility of a teacher's leaving a position in the 
ay , or an indefinite period of time with the opportunity of returning 
facto ater date. This significant recruitment inducement is possibly a 
"e r related to mortality in the teaching profession which may be 
is ponsible for some of the teacher shortage. Presumably it would be 
passible or at least inadvisable to require adequately prepared teachers 
vith credentials to remain in the profession. However, if selective tech- 
"t could be developed that would pick out the one teacher who 
uld remain on the job for 40 years, the time spent in training and 
Supervision as well as untold amounts of money could be saved. 

There are few studies in the recruitment and selection of teachers which 
are concerned with turnover and termination—either permanent or tem- 
Porary, An investigation by Murphy (1962) has already been mentioned 
under Recruitment. Another study, by Rabinowitz and Crawford (1960), 

ealt with the relationship between predictor variables obtained during 
Student teaching and subsequent persistence in the profession. They found 
Mat 40 percent of the original group of 1,144 graduates responding to the 
questionnaire had taught continuously for five years. Six percent had not 
taught, High school teachers did have a much better record of persistence 

lan did elementary school teachers More than three-fourths of the men 
Braduates were teaching five years later, but less than one-half of the 
Women were doing so. The older graduates were much more likely to 
Persist than were the younger ones. Those who did indicate satisfaction with 
their student teaching experience were more persistent than those who did 
not, A general summary would suggest that, in terms of the data found for 
the Predictor variables studied, male students and women of 30 or more 


à 
Years of age tended to make a career of teaching. 


Pe s 
TSOnality Characteristics 
at Minnesota Multiphasic Personality Inve 
si, lity traits of tea T 3 
me assista E a n (1961) found that teachers con- 
i nc lection. Flanag 
adered to be x ees “did rank higher on the MMPI scales of hypochon- 
Tlasis, Cac d and the ma sculinity-femininity 
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scale than did teachers not considered superior. Fielstra (1963) reported 
that the personal qualities of forcefulness, emotional balance, and whole- 
some philosophy of life were the only ones rated in student teaching which 
were discriminating between excellent and good first-year secondary school 
teachers. 

Since the publication of the MTAI, there has been a considerable volume 
of research with this instrument upon attitudes of teachers and teacher 
education students. The MTAJ measures only a limited number of teacher 
attitudes. There is a need for new instruments to furnish information about 
other attitudes. 

Employing the MTAI as well as two scales concerning educational values, 
MacDonald and Doll (1961) obtained evidence that older, part-time 
students did possess more mature attitudes about teaching than did 
younger, full-time students. Darrow (1962) found the MTAI to be 
sufficiently related to the standing on a teacher rating scale that he recom- 
mended the use of the MT ALI in selection of students in teacher education. 

Vertein (1961) used the MTAJ to compare one sample of students 
planning to teach with a second sample which he called an “academic 
population” and found that the former group excelled the academic grouP 
by a significant amount. Scott and Brinkley (1960) studied the improve 
ment made in the MTAI scores by student candidates during a semester 9 
practice teaching. Those whose initial scores were inferior to those of the 
supervising teachers improved significantly, whereas those whose initial 
scores excelled those of the supervising teacher failed to make statistically 
significant improvement. They concluded that these results indicate a lac* 
of either predictive or concurrent validity in the MT AI. 

Popham and Trimble (1960) also checked upon the validity of the 
MTAI in their research to ascertain whether it could differentiate between 
groups of teachers rated successful or unsuccessful by administrative 
superiors. They stated that, since the inventory did differentiate betweer 
these groups at the .01 level, it should have considerable validity in 
selecting teacher education candidates. 

Sheldon, Coale, and Copple (1959) assessed the degree of difference in 
certain personality traits of students aspiring to be teachers who ranke 

high or low in warmth or friendliness on prediction scales. They found that 
those who were high in these characteristics as compared with those who 
were low not only were significantly higher in intelligence and lower 1n 
authoritarianism but also expressed a significantly higher need for affilia- 
tion and a lower need for succorance. The EPPS and the Thematic Apper- 
ception Test (TAT) showed conflicting results in the needs of "warm 
teachers for dominance and aggression as well as in their being character- 
ized by these two traits. These conflicting findings suggested a lack o 
construct validity in the traits measured by either one, or by both, of the 
instruments. Conceivably, the traits may be parts of a larger personality 
configuration that is in need of further analysis. 
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Lang (1960) conducted a study which was more directly concerned 
= motives of students in deciding upon teaching at the elementary or 
secondary school level than were many other related investigations. He 
concluded that nurturance or “mothering” is significant in the choice 
of elementary level teaching and that achievement is relatively less impor- 
tant. For students choosing to teach in high school, the order of importance 
of these motives is reversed. Employing the EPPS. Southworth (1962) 
made a breakdown of the entire range of elementary teacher preparation 
into upper elementary ‘and lower elementary levels and then studied the 
characteristic needs of teachers preparing for each level. He found that 
candidates preparing for the early or lower elementary level teaching were 
characterized by a greater need for abasement, affiliation, succorance, and 
nurturance; whereas the students preparing for upper elementary teaching 
revealed the need for achievement, aggression. and exhibition. 

Garrison and Scott (1961) studied the personal needs of students who 
Were preparing to teach at one or more levels within the span from 
kindergarten through high school. The students were classified into five 
teaching areas: (a) lower elementary. (b) upper elementary, (c) general 
Secondary, (d) nongeneral secondary. and (e) special education. Those 
Students planning to teach at the general and nongeneral secondary level 
Were further divided according to the subject areas in which they were 
Preparing to instruct. In this aspect of the research design, it was assumed 
that there may be differences in the personal needs measured according to 
the prospective teaching level and the subject to be taught. Although the 
analytic material was detailed and long. the following findings were 
Suggestive: (a) The general secondary women did exhibit a significantly 
nee need for achievement than did women in either p oe or 

s genera j he prospective teacher of lower 
T le cp mo p P e easter degree the need 

ry grades did manifest to & sig yg 1 [em i 
9r nurturance than did the upper elementary. genera! secon p non 
Benera] secondary, and the prospective teachers of social studies, language 


arts, and business education. 
In still another study involving 
E the difference between th 
€reotypes as indicated by nonteach 


Manifest need 
Tom the UN of the last five studies. all of which used the EPPS, it 


Might be said in summary that the manifest needs of prospective teachers 
iffered according to their proposed ieaching level and to some extent 
cording tothe expected teaching area in high school. Elementary teachers 


i N - 
: Seneral did exhibit a significantly greater need for ae ae sored 
Hm affiliation, change, and abasement than did high school teachers. 

‘8 AE eene da show a significantly greater need for 


gh School te; i rn did 
teachers ar nt nd order than did elementary school 


e EPPS. Saltz (1960) investi- 
of teachers and the 
nions of teachers’ 


use of th 
e stated needs 
ers who expressed opi 


chi E 
fa vement. aggression. exhibition. 2 
"achers. 2 
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In the Saltz (1960) study, another valuable contribution was the hy- 
pothesis that the nonteacher’s opinion of the teacher or of teachers in 
general—the stereotype—may tend to mould the asserted needs of student- 
teaching candidates toward this stereotype of the teacher. Taking a contrary 
point of view, Cook, Linden, and McKay (1961) suggested that students 
planning to become teachers possess within their personalities certain basic 
need orientations which are satisfied best through participation in teaching 
and in activities related to eduration. Although further research upon this 
issue is called for, the point made by Saltz is highly pertinent. If the stereo- 
type of the teacher in the minds of potential teachers appears to be more 
or less repulsive, then recruitment of students for the program in teacher 
training becomes more difficult. 

These few findings from the studies reviewed are sufficient to show the 
detail in which the results may be described. Although teachers in general 
may be characterized by certain descriptive terms, within the profession 
there appear to exist significant differences in traits which the teachers 
of each sex possess relative to both subject matter and grade level taught. 

In their second study involving a sample of students who chose teaching 
as a career, Garrison and Scott (1962) considered the relationship between 
(a) such background factors as age, marital status, class standing in college: 
and years one planned to engage in teaching and (b) specific psychological 
needs as measured by the EPPS. Of the 15 psychological needs, 14 varied 
according to one or more background characteristics. Of the 12 char- 
acteristics, only 2 were unrelated to any of the 15 psychological needs. 


Factor Studies 


From the results of a number of factor-analytic studies, a partial basis 
is furnished for the identification of the characteristics which may b^ 
employed to select candidates for programs in teacher education. In one 
such study, Cook, Linden, and McKay (1961) analyzed the EPPS and the 
GZTS and identified the following six factors: (a) docility, (b) de 
pendency, (c) authoritarianism, (d) compulsive conformity, (e) intro" 
version-extroversion, and (f) avoidance. Using the technique of having 
faculty members in a department of education check a list of statements 
descriptive of a good teacher, Myers (1960) isolated four factors throug”! 
a statistical analysis: (a) general acceptability, (b) readiness to learn, (c) 
social sensitivity, and (d) impulse control. 

Mitchell and Pierce-Jones (1960) carried out a facior analysis of 
on scales of the California Psychological Inventory (CPI) that had been 
administered to 258 university students. The investigation listed four major 
factors provisionally identified as (a) adjustment by social conformity: 
(b) social poise, (c) superego strength. and (d) capacity for independen! 
thought and action. The results of this study pointed to a possible A 
arrangement of some of the 18 subtests of the CPI. The authors sugges 


scores 
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k that the 18 CPI scales did not represent 18 independent personality 
dimensions. 

Perry (1962) carried out a factor analysis of the intercorrelations of 43 

ie predictor variables and of the ratings of four judges of the teaching 

performance of secondary school student teachers. The factors were 


identified as (a) intellectual ability, (b) social adjustment, (c) profes- 
sional enthusiasm, and (d) speaking ability. There were few conclusions, 
if any, which could be derived from this, group of factor studies. The 
populations employed.in the testing were dissimilar, and the instruments 
also differed. ! 

À word of caution should be given to anyone attempting to do research 
with the factor-analysis research tool—especially with personality scales. 
Before commencing a [actor analysis project, one should review the 
conditions that influence the outcomes of factor analysis. Only one source 
of error or misinterpretation will be mentioned here. Spurious correlations 
arise with the use of scores that are experimentally dependent, e.g., in 
Personality tests in which the scores are derived from overlapping keys. 

A study by Jackson and Moscovici (1963) concerning the personality 
characteristics of students identified prior to their assuming the role of the 
teacher was noteworthy because of the measuring scales employed. They 
used three paper-and-pencil tests. The subjects (a) drew a picture of a 
teacher, (b) responded to a word association test in which the words might 
e related to teaching or to the school, and (c) participated in a word 
Completion test similarly related to teaching and the school. Results of in 
research indicated that trainees at the beginning of their menn. ha 

egun to identify with the teacher role. They also revealed a type of con- 

ict or uneasiness with the authority role that they would pon "uei as 
Well as the desire to maintain a pleasant, protective, nonauthoritarian sc “a 
atmosphere, Minimizing the significance of the conflict aspects of the stu " 
=! maximizing the novelty of the een employed in the research 
endeavor, one can hope for more studies o his type. 

Highly estis: v is one general reference by — su Telus 
1963) Which is a nearly complete bibliography of ep n : 
With the personality and characteristics of the teacher. The student o 


i . V ing. 
& of this chapter especially rewarding. 


Conclusion 


j : H . n 
i i n made in the improvement o 
Although noteworthy strides have bee JL perenne © 


Tese A : : son o 
i ee designs and in the introducta A estion, ond cannot llo but 
related to problems of recruitm - Toa . 
© impressed E the lack of replication » qoe pert pes pe 
rese r ne test batteries v 
archers seem ever to use the Sa! Je, it is difficult to ascertain 
, 


Populations that could be judged comparab 
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whether any genuine direction in the findings may be detected. It would 
be helpful if large-scale cooperative studies could be undertaken in which 
attempts are made to develop a systematic body of knowledge. 
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CHAPTER III 


Pre-Service and In-Service Education of Teachers 


HAROLD E. REYNARD 
Pueresrions by both professional educators and laymen highlighted the 
current intense interest in the preparation of teachers. Unfortunately, 
indications of bold experimentation either in curriculum offerings or in 
meriad: used in teacher education could not be found. Evidence of research 
n teacher education, as reported in the literature between January 1958 
and March 1963, is abundant in relation to field experience, measures of 
teachers’ attitudes, and programs designed primarily for liberal arts 
college graduates. Little evidencë of experimentation with the total 
Program of teacher education was found for this period. 
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for preparing elementary teachers was made in the Yale-Fairfield Study, 
which was reported by Howell and others (1960). Longer than four 
academic school years, this program was an attempt to overcome two 
major problems in teacher education: too little professional laboratory 
experience and too little general education. This program. which utilized 
three summers and four school years, provided extensive field experience 
with little sacrifice to the general education program. 


Pre-Service Programs 


Research concerning the effectiveness of teacher education programs 
has been confined largely to questionnaire or opinionnaire surveys. Vari- 
ables in such domains as personality, curriculum, and methods continue 
to defy satisfactory control and measurement. This lack of control and o 
precision in measurement may explain the reluctance for experimentation 
in teacher education. Clearly defined criterion measures of the competencies 
and qualities that are needed to be an effective teacher should hasten the 
development of the satisfactory planning of programs and their evaluation. 
Fishburn (1962) found significant differences between teachers! an 
administrators’ concepts of the teacher’s role. Follow-up studies of teachers, 
except for surveys of their opinions about the programs, were conspicu- 
ously absent from the literature of the past five years. 

Of some relevance to the pre-service program has been the 
campus-wide committees concerned with teacher training programs. 
lad and Boozer (1961) pointed out that the number of campus-wide 1 
mittees representative of all faculty members who work with pre-service 
teachers has increased steadily since 1890. These writers posed several 
insightful questions concerning the need for and value of such committees 
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Elementary School Teacher Education 

To evaluate the program for the preparation of elementary teachers at the 
University of Oregon, Piper (1960) surveyed 155 graduates and 
principals of the schools in which the graduates had taught. It was ; 
termined that 42 percent of the graduates preferred both the same e 
of work in each course and the same sequence. Many of the responden : 
recommended an increase in the number of hours in science. The awe’ 
of this group who did not teach expressed more dissatisfaction with the 
preparation than those who were teaching. 

Through the study of college catalogues. Andrews and Palmer ( 
examined the programs for the preparation of elementary teachers 9 
public and private institutions of seven different types in 19 co Ta 
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courses, a A i 
n» electives. Credit hours required for general education 
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education were nearly the same for all five types of institutions. On the 
basis of his findings, he estimated that 16 to 20 percent of the total course 
work for a degree was devoted to professional education. 

Many undergraduate programs for the preparation of teachers were 
described in part five of the Official Report of the Kansas Conference of the 
National Education Association, National Commission on Teacher Educa- 
tion and Professional Standards (1959). 

Strong interest in the subject matter preparation of the secondary 
teacher was evident in the literature. In order to determine the quantity 
and type of specialized study of English language other than composition 
required for high school English teachers, Slothower (1959) sent a 
questionnaire to 395 English department heads, of whom 360 responded. 
Tt was determined that 71 percent of the institutions required one or more 
courses in the special study of the English language for teachers majoring 
in English. Of the 256 requiring special study, 77 percent required only 
one course. Prescribed work in grammar predominated most programs that 
were intended for the preparation of English teachers. 

In a short article, Mayor (1962) introduced a group of ninc reports 
concerning special projects that had been designed to improve the science 
and mathematics preparation of teachers. These projects were part of the 
Science Teaching Improvement Program of the American Association for 
the Advancement of Science supported by the Carnegie Corporation. In 
a historically oriented study, Schumaker (1959) determined that. only 
slight changes occurred between 1920 and 1958 in the requirements for 
teachers who majored in teaching mathematics at the secondary level. 
Questionnaires were sent to 314 mathematics department heads, and course 
requirements for 140 institutional programs were examincd. The data 
revealed that elementary mathematics through calculus, including soli 
geometry, differential equations, and theory of equations. represented the 
preparation required in the academic year of 1920-21. At that time, a mini- 
mum of 24 semester hours was the median requirement for major prepar? 
tion. During the 1957-58 school year, the median number of semester hours 
required was 27. The mathematics educators’ recommendations for teacher 
education courses were consistent from 1920 to 1958. 

In a study of human relations in the classroom, the North Central 
Association of Colleges and Secondary Schools, Subcommittee on Human 
Relations in the Classroom (1962) reported little concern by college a" 
university staff members and no organized effort for preparation 9 
teachers for successful human relations with students, parents, and the 
community. Questionnaire returns- from 1,108 prcfessors from severa 
disciplines representing 336 North Central institutions were examined t° 
determine concern for and pre-service practices for preparing teachers t° 
deal with children as persons, to work with parents of all kinds, and !? 
understand community conditions that affect children’s development. i 
though this survey showed that large numbers of professors valued hed 
aspect of teacher preparation highly, they placed little emphasis on : 


school 
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their course: I e 
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Suifes $06 social class were a source of more problems than were 
Ses e ving relations with minority groups. Information obtained 
se studies should be useful for program planning at both pre- 
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master’s degree in the near future. Classroom control and student motivation 
were listed most frequently as the most important problems they faced. 

In an earlier study of the extent and status of in-service education in 100 
Indiana schools, Taylor (1959) found that wealth of a school district was 
a principal factor in the amount of in-service education provided by a 
school system. The larger, wealthier city school districts provided more in- 
service education than the smaller, poorer districts. 

Designing workshops and institutes to serve best the continuing educ 
tional needs of teachers in the field confronts both public schools and 
teacher education institutions. Fowler (1960) described an NSF-supported 
summer institute in 1959 for the preparation of 45 elementary schoo 
science resource teachers. Each of the teachers had at least three years of 
classroom experience and was teaching during the year prior to the work- 
shop. Increased achievement in general science as measured by the test 
selected and administered prior to and following the program was con- 
sidered to be highly significant. 

In a survey of 28 colleges and universities, Woronoff (1958) classified 
four patterns of teacher education programs for liberal arts graduates. In 
nine universities, courses were offered to permit candidates to meet only 
minimum state certification requirements. Eleven universities required a 
series of courses or a prescribed pattern to be completed before the 
institution recommended certification. Six institutions offered special semi- 
nars to provide supervised group experiences as well as supervised student 
teaching, with much of the work applicable to a master's degree. Only tw? 
institutions provided individualized programs leading both to a master's 
degree and to recommendation for certification. 
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evision. 


education: (a) programed instruction and (b) closed circuit tel 
Little was found in the literature since 1958 concerning the use of pro 
gramed instruction in teacher education. Interest in the use of closed circuit 
television in pre-service programs has led to several experiments in 1 
employment during the 1958-63 period. Rumford (1960) experimente 
with the use of closed circuit television in the teaching of elementary 5° 100: 
methods. Two groups, one experimental group and one control group» 
of 15 pairs of students matched on several factors participated. Teacher 
developed objective tests were given to measure diflerences in attainment 
In achievement, the TV lectures were judged to be as successful as pon 
ventional instructional methods, Although it was found that .elevision € 
as effective as direct observation for entire groups. it was less a ur 
tageous for high-ability students than for students of less ability, re!a 

to measures obtained from the objective tests. Moreover, 1n terms © ai 
s and final examination scores received by students, television 
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consid 
in cri E i ioa successful as other methods used. In relation to changes 
(MTAI) scores eios by the Minnesota Teacher Attitude disgentory 
Thompson (1960) 35107 was as influential as conventional methods. 
diserva on, r l aae out an experiment with three methods of 
television asi Fa ëlassrögm observation, observation by closed circuit 
each far c. des servation by films—to determine the relative values of 
conclusions: (a) a selected student attitudes. He presented the following 
eed tes o Ed attitudes of the students toward children were not influ- 
(o dien c icant degree by any of the three methods of observation; 
television cbs necnon was preferable to that based on television, and 
in Doc inn was superior to film observation; and (c) differences 
E Ghiervsiio appeared to influence student attitudes more than the method 
raphy, re n. Other studies, which were listed in Thompson’s bibliog- 
, reported similar results concerning these three methods of teaching. 
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(c) 10.85 and 9.97 in the liberal arts colleges. Inlow also presented informa- 
tion concerning compensation to cooperating schools and teachers. selection 
and competencies of cooperating teachers, field experience provided prior 
to student teaching, and many other practices of the 38 institutions. 

Through a study of 551 questionnaire responses, Jones (1960) found 
that full-time college supervisors were responsible for an average of 18 
off-campus student teachers and that part-time supervisors were responsible 
for 15 off-campus student teachers. On the average, four student teachers 
were assigned each cooperating school. In addition..he found that college 
supervisors typically visited their student teachers three or four times 
during the teaching term. 

Price (1961) studied the influence of public school supervising teachers 
on the attitudes and practices of student teachers. His findings indicated 
that student teachers adopted many of the practices of their public school 
supervisors during the student teaching period. He noted that students 
attitudes changed in the direction of those held by their respective public 
school supervisors. McAulay (1960) observed six young teachers during 
their first year of teaching to determine which of the practices and materials 
they used were similar to those of their respective publie school student 
teaching supervisors. During their student teaching, the six young trainees 
had been under the direction of one of three supervising teachers. The in- 
vestigation concluded that the materials and techniques prepared and used 


during the period of student teaching were employed more extensively 
during the teacher's first year of teaching than w 


techniques developed in methods courses. 
Many writers in teacher education stresse 
at the preparation of public school supe 
adequate numbers of schools to furnish 
supervisors to direct professional labor 
nated the opportunity either to select su 


them with special preparation and experience. Hayes (1960) survey 

10 teacher education institutions to find what they were doing to prepare 
beginning supervising teachers, Subsequent to visiting the institutions, inter- 
viewing the faculty members, and studying the literature provided, he 
arrived at the following conclusions: (a) six institutions offered a specia 
course in the pre-service program for the preparation of supervising teach- 
ers; (b) the most prevalent methods for orienting supervising teachers 
were by means of conferences and literature; aod (c) there was no 
organized effort to have the supervising teachers become familiar with the 
teacher education program. With respect to provision for additional prep? 
ration of the experienced supervising teacher, Hayes presented the follow- 
ing information: (a) only two teacher education institutions had provide 

specific budgets or plans for financing in-service preparation programs: 
(b) group conferences and workshops commonly were used for improving 
supervision; (c) data sheets for student teachers were supplied to supe?” 
visors in all but one institution; and (d) only 5 of the 10 institutions 
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ere those materials an 


d the need for programs aimed 
rvisors. Frantic searching for 
facilities as well as for qualifiec 
atory experience nearly has elimi- 
pervisory candidates or to provide 
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and students together before the student 
Juded that little had been done at the 
ervisors and that lack of specifically 
lopment of adequate in-service programs 


attempted to bring supervisors 
began student teaching. He conc 
pre-service level to prepare sup 
assigned finances hindered the devel 
for experienced supervisors. 


The Campus Laboratory School E 


oted to the role of the campus 


Much study and thought have been dev 
laboratory school in teacher education. Inspection of the professional 


literature since 1958 indicated that use of laboratory schools for student 
teaching has been declining. However. in the opinion of several writers, 
growth of television in teacher education, increases in the amount and kinds 
of field experiences other than student teaching, and increased commitment 
to research have maintained the value of campus schools for both pre- 
service and in-service education of teachers. Despite these positive points, 
Ohles (1961) has described clearly the dilemma facing the laboratory 


school. 

Hughes (1959) not only made 
schools but also examined their roles fr l 
He visited 7 campus schools and received questionnaire } 
others. Among the several findings determined from his visits to the campus 


schools, two were particularly relevant: (a) the role of campus schools 
varied from institution to institution, and (b) the use of campus schools 
for student teaching was declining. Questionnaire responses from 29 insti- 
tutions yielded the following information: (a) 24 campus schools still were 


being used for some student teaching; (b) 19 participated in research md 
vided for observation and participation; an 


3] ; A 
experimentation; (c) 14 pro ; $ 

(d) 11 served as locales for demonstration teaching. . 

From the responses to a questionnaire which was planned to determine 

what teachers in campus laboratory schools considered the role of such 

schools to be in teacher education, Venable (1960) reached the following 

two conclusions: (a) the student teaching function should be discontinued 

in campus chooks and (b) observation and purtcipsous i uei 

to constitute the most important functions of such a rem e ) 

noted that the purpose oF function of laboratory schools, whic : i os 

not clea I ] to bea highly individual matter m terms © specia 
EE la tutions. He indi- 


; education insti 
needs ata f particular teacher i B 
and problems ot P: have recast their roles from student teaching 
è cated that many campus schools have 


to other forms of direct experience for students as well as to research. 


i His survey of 21 campus schools yielded the following data concerning 
use: (a) 21, for observation; (b) 18. for demo 


nstration; (c) 15, for 
student teaching; (d) 11. for professional services: and 


(e) 9, for research. 
The stage campus schools are in is questionable. However, there is no 
doubt that they are unde phosis. 


rgoing metamor 
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om a historical point of view. 
aire returns from 29 
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Needed Research 


Much experimentation and research, especially with techniques Die 
than the questionnaire survey, are needed in relation to all aspe n 
teacher education programs. Values ascribed to most of the compon 
parts and methods used in pre-service programs rest primarily on Saee 
able theoretical foundations. Experimental evidence is needed if a soun 
base for teacher education is tu be established. 

Continued efforts to determine experimentally the qualities and compe- 
tencies required for an individual to become an effective teacher are essen- 
tial to the improvement of pre-service programs. Most critical among the 
research needs in this field of higher education is experimentation with 
five-year comprehensive programs. Satisfactory evidence concerning the 
strengths and weaknesses of single-purpose and multipurpose institutions 
for achieving the goals in each of the three aspects of teacher education— 
general education, field of specialization, and professional education—1$ 
required to guide changes in organization and program. 
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CHAPTER IV 


Conditions of Employment and Service in 
Elementary and Secondary Schools 


GARFORD G. GORDON 


A SURVEY of the research reported during the past five years on employ- 
ment and conditions of service for teachers reveals a somewhat barren 
scene. The necessary routine of social bookkeeping has gone on steadily 
with only minor changes. Summaries of legal provisions, salary regulations, 
salaries paid, economic conditions, school district statistics, and measurable 
teacher characteristics such as age, sex, formal education, and salary 
received have been reported. The best and most comprehensive of these 
studies have been those published by the U.S. Office of Education. Many 
local surveys have been carried out by state education associations and 
other groups with special or regional interests. Most of these studies, which 
Were directed toward legislation or toward the adoption of improved 
district salary schedules, generally have been of limited value to educational 
researchers but often of major importance to those directly involved. 
When one looks for research going beyond the collection and rough 
Classification of quantifiable facts about current conditions, the picture 1s 
rather bleak. The literature mostly consists either of statements of opinions 
and ideas or of descriptions of plans and panaceas—together with unevalu- 
ated and often uncritical and highly biased descriptions of innovations. 
Few studies which are based on clearly stated hypotheses and strict defini- 


tions of conditions have been reported — m 
This chapter will not discuss all of the weaknesses in the organization 
of educational rescarch that contribute to this situation. However, one 


Probable causal factor which is clearly important to the researcher has 
been the subject—either directly or peripherally—of some of the better 
studies reported since 1958. This factor 1s the lack of a clear definition of 
the role of education and of the teacher in American society. Teaching 
conditions, which to a great extent depend upon role expectations held by 


i / learly analyzed 
and fo j t be unequivocally evaluated or c a yze 
p ade explicit. This problem is revealed in 


When these expectations are not m 
varying eae many studies. particularly in those reported by Barr 
(1961), Davis (1962). Ryans (1960). and Turner and Fattu (1960). 
Since their results have a bearing on the ecc pe of ae in all 
other ar x loyment conditions, studies of teacher status and 
eaa ndi n d first in this chapter. Staff organization 


of role A id 

expectations are cons ere c r 

and utilization of staff members are treated secondly. Other topics discussed 

are teacher load; employment, assignment, and turnover; evaluation, 
; 
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i i 5 i itions of 
retention, and dismissal; economic status and economic conditi 
i H H H 
employment; and teacher morale and job satisfaction. 


Teacher Status and Role Expectations 


Most information about teacher status and role in foreign countries 
must be inferred from more m 


general studies on ccmparative education. 
Research studies, as distinct from personal narratives and general descrip- 
tions of educational conditions, were lacking during the period of this 
Review. However, there have been studies of some of the more easily 
determined aspects of the over-all status of public school teachers. The 
National Education Association, Research Division (1962d, 1963) reported 
findings as to the age, experience, training, and salary of teachers based 
on a sampling study conducted by a technique it recently had developed. 


Mason (1961) made a final report on the survey of new teachers conducted 
by the U.S. Office of Education during 


study of men teachers and ex-teachers wh 
the Air Force in 1943 w. 


found that ex-teachers were si 


teaching were somewhat limite 
relatively productive of implic 
sented a penetrating analysis of the frustrations and of the problems 
resulting from the public's exa 

cluded that the public credited 


beyond the level of teacher competence, and indulged in wishful thinking 
about “what schools should d 


atus of the teache;, Ulich (1958) decided 
tol in the selection and in the retention of 
teachers would have great effect on teacher status, but that one of the 
to the teacher’s status was “the structure of 


) studied role perceptions of teachers in 
er professional workers. On the other hand, 
in (1962) compared the role expectations of 
> and noneducation students. They concluded 


y be responsible for a substantial proportion of 
the high dropout rate during the first year of teaching. Discrepancies in 


role perception also were Teported in studies by Rubenstein (1959) and 
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Fishburn (1962). Some insight into contradictions between stated and 
ow teacher roles was furnished by Hagstrom (1962). who found that 
elementary teachers spend a major part of their workday doing sub- 
professional tasks. j 


Staff Organization and Utilization 


as been an increase of interest in the 


nature and organization of teachers’ work. Attempts have been made at both 
the elementary and the secondary levels to determine the activities required 


in a teaching position and to reorganize them with a view toward improving 
leaching conditions and the utilization of teacher time and talents. A 
helpful summary of research in this area was made by the National Edu- 
cation Association, Research Division (1959b). The best study of the use 


of elementary teachers' time was reported in the previously cited paper 
by Hagstrom (1962). He concluded that such teachers spend about 30 
percent of their time at school doing subprofessional or clerical work. 
Stafford (1962) found that this percentage was decreased significantly 


When teacher aides were employed. 

One change designed to reduce the noninstructional load of elementary 
teachers has been the introduction of provisions for duty-free noon periods. 
In 1959, according to a report of the National Education Association, 
Research Division and the American ‘Association of School Administrators 
(1960), 25 of 33 urban districts surveyed reported employing lay personnel 
for noontime supervision of students. However, for the same year, the 
National Education Association, Research Division (1962a) found that 
only 11,9 percent of all districts over 2,500 in population had duty-free 
lunch periods for teachers. Most of the districts allowed for such periods 
Simply by not providing for supervision of pupils at noontime. 

Although the problems of utilization of teacher time and of the employ- 
ment of teaching devices and subprofessional or clerical aides are closely 
interwoven with curriculum and instruction, they also have a direct bearing 
9n the status and working conditions of teachers. Since most reports in 
this area were, however: descriptive oT argumentative, they cannot be 
considered yessartli documents. One exception was à study of opinions 
?n the use of lay readers of English papers reported by Burke (1962). 

Undoubtedly, the best publicized activity on staff utilization was con- 

ucted at the secondary level with the encouragement of the National 

Ssociation of Secondary-School Principals. Most of the experiments and 
Studies reported from this program did not constitute acceptable research. 
t "rig (1962) discussed many of the problems involved and concluded 
at there was a need to improve the standards of research being used to 


t 
est such new ideas. 
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Teacher Load 


There has been much discussion of teacher load during the period saure 
by this REVIEW. Most of it has taken the form of descriptions, reni, 
and proposals for policies. Very little of the published material is "PR 
with research. Although the data presented were purely normative, t = 
portions concerned with the determination of teacher opinion and attitudes 
threw some light on the relationship between load and morale or a ae 
load and job satisfaction. McLaughlin and Shea (1960) found that load 


was the most important source of teacher job dissatisfaction in California 
elementary schools. 


Romine (1958) described assignment practices and their relationship 


to load in Colorado high schools, Using the revised Douglass formula, he 


found no clear trends and noted little difference between small and large 
high schools, Perhaps his most 


significant finding was that beginning 
teachers had about the same ] 


oad as experienced teachers. 
Brown and Obourn (1959) reported on assignments and teaching loads 


of mathematics and science teachers in three Middle Atlantic states. This 
was the most comprehensive survey of the teac 
teachers that could be found durin 
The survey was limited to use of a 
load. Class size, 


hing load of a category of 
g the period covered by this REVIEW. 
ssignment as the primary definition of 
number of classes, and other conditions affecting the load 


were given little attention. However, to the extent that the teacher's prepara- 
tion for subject areas to Which he 


0 is assigned is a factor in load, this 
study is quite valuable. 


ciation, which has long been interested in 
o has regularly coriducted surveys, although 
these have been limited to elementary schools, In reports issued during the 
period covered by this Review, the National Education Association, Re- 
search Division (1962c) revealed that there had been some relief in 
middle-sized districts but that the over-all picture concerning class size 
had changed little since 1956, i 

The hypothesis that all teachers favor a class size smaller than the one 
ey have was partially tested in a study reported by the National Education 
Association, Research Division (1961c). It was found that, when separately 
surveyed, the majority of both elementary school teachers and principals 
agreed that a class consisting of between 90 and 24 pupils was the optimal 
size. 
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Employment, Assignment, and Turnover 


Most research on the employment of teachers reported in the last five 
years was oriented toward the problems of recruitment and turnover. The 
legal aspects of teacher employment were competently dealt with by the 
National Education Association, Research Division (19592). 

Green (1960) described case studies of a nationwide sample of school 
districts and concluded that there was an increasing use of long-range plans 
for recruitment. He found that larger districts tended to have more elaborate 
programs and to use more imaginative techniques in contrast to the small 
districts which followed traditional patterns. He also noted an increasing 
tendency to involve principals and teachers in recruitment programs. Ina 
study of school systems around New York City, Redefer (1962a) deter- 
mined that the cost of recruiting one new teacher was between $500 and 
$1,000. He concluded that recruitment procedures should focus on obtaining 
candidates qualified to fill specific needs of districts. . 

There was little systematic research reported on the assignment of 
teachers once employed. As noted previously, Romine (1958) revealed 
that beginning teachers were assigned the same load as were experienced 
teachers in Colorado high schools. Brown and Obourn (1959) found that 
in three Eastern states science and mathematics teachers were frequently 
assigned classes in subject areas other than those in which they had received 


their major preparation. 
One of the major conc 
teachers is their retention after employment. 


toward the turnover problem among beginning 
(1959) determined that, although teacher turnover was less than that of 


Social workers and nurses, the averag® rate was "i Lem cn n: 
turnover rate was greatly influenced by size oi C39 rict. The Nationa 


Ed 1 Meere. h Division (1961b) reported a median 
Horton: Axaprlafiin, Renis t for districts of 100,000 or more 


terminati 7.3 percen 
ayetageidaily, cole es a a comprehensive survey, gu fee € 
that 51 percent of first-year teachers did not eem H 3 o hing n 
Years later, even though they were generally satisfic € um ue 
Kleinman (1960) discovered a significant positive correla en el pur = 
Previous knowledge of the beginning teacher about con is = es m 
Satisfaction with his work. He concluded that prospective teac pida j^ 
e given detailed information about school and community conditions by 


districts doing recruiting. 


iduals responsible for hiring 
Several studies were directed 
teachers. Mason and Bain 


erns of those indiv 


Evaluation, Retention, and Dismissal 
The major problem reported in connection with evaluation was the lack 
of known esc measures having valid relationships to teacher effective- 
hess in producing desired changes in pupils. Moreover, many studies on 
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teacher evaluation were reported during the period covered by this Review. 
Davis (1962) analyzed the problems involved in securing uniformity in 
the assessment of teacher performance. No studies were found which dealt 
specifically with the processes or criteria involved in making decisions con- 
cerning retention or dismissal of staff. 

In a summary of more than 75 studies on the measurement and pre- 
diction of teacher effectiveness, Barr (1961) concluded that the difficulties 
posed by measurement had not 


gation, Ryans (1960) found that there were definite characteristics of 
teacher behavior 


were isolated. 
In a carefully conducted study, Worth (1961) raised considerable doubt 
ere able to evaluate teachers and 
dinal study that followed the plan 
on Teacher Effectiveness in 1952. 
also concluded that their findings questioned 
able body of research in teacher effectiveness 
me kind as a criterion, 

evaluation on the 
that a teacher mus 


eralized context. Turner and Fattu (1960) pre- 
etical discussion o 


gested a sound proposal for attacking it. Gowan (1960) 


€ conditions of their employment are those of the 
Sociation, Research Division and the research offices 
ation associations, Reports from state associations 


in most cases to larger sch 


from their coverage is small 


The general economic Status of teachers also is reviewed periodically. 
The latest report of the National Education Association, Research Division 
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(1962b) showed that there was a great variation among states and districts. 
Although the average salaries of teachers were found to be increasing 
faster than the cost of living, it was noted that less than 1 percent of all 
classroom teachers had a salary as high as the income estimated to be 
required for a family of four to live at the junior professional or executive 
evel. 

Teacher salaries are supplemented by other economic provisions of 
employment. Among these are retirement benefits, the legal aspects of 
which Punke (1962) considered in an eXhaustive study. A study by the 
National Education “Association, Research Division (1961a) showed that 
many districts were paying part of the premium for insurance programs 
covering school employees. However, Dingman (1961) found that other 


types of monetary benefits were rare. 


Although much was written on merit rating 
this Review, little had research value. The best work published was that 


of Stegeman (1959) who presented a competent review of the historical 
development of merit rating and of the differences between its application 


in industry and in education. 


during the period covered by 


Teacher Morale and Job Satisfaction 


f working conditions and an indicator of the 


climate of human relations in which work is performed. Sometimes, the 
terms morale and job satisfaction are used as synonyms 1n the literature. 
There is no question that they are closely related. Perhaps it may be said 
that job satisfaction is commonly used to refer to the reactions of indi- 
viduals to specific elements in their working environment, whereas morale 
often is applied to the general level of satisfaction and enthusiasm of 


individuals and groups. 
Robinson (1959) and Robi 
marized job satisfaction research 
Some of the studies reported we 
others contained important implic 
tration. Blocker and Richardson ( ) 
Published and unpublished studies in 
needed research. They concluded that researc 
provide a means for evaluation of administrativ 
need, 


Morale is both a result o 


nson and Connors (1961, 1962) have sum- 
during the period covered by this Review. 
re concerned directly with teachers. Many 
ations for educational personnel adminis- 
1963) gave an excellent bibliography of 
this field along with a statement of 
h concerning morale might 
e behavior—a long-standing 


A number of studies attempted to identify factors influencing teacher 
morale and job satisfaction. From 2 study involving 24 school systems 
and 5,000 J I e Redefer (1959) found that a close relationship exists 

etween more and the quality of education present in iioa pa 
T 2b) ascertained that salary was a signi 
urner (1960) and Redefer (196 ) er d. nk pus 


cant rever, many other factors were : g 
ae factor. EOS (1960) found that working conditions 


importance. McLaughlin and Shea : 
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as affected rure ig load and negative pupil attitudes were a major 
job dissatisfaction. _ 
gi mens was concerned with interactions of teachers and adminis- 
trators. In a study that was an example of good research design, Willower 
(1960) determined that principals holding a relaxed and nondirective 
view of their administrative function tended to hold a higher opinion of the 
professional status of teachers than did those principals who appeared to 
be somewhat unrelaxed and directive in their orientation. Rasmussen 
(1962) noted that in some secondary situations discrepancies existed even 
when real attitudes were very similar. 

Three studies indicated that discrepancies between teacher attitudes 
and expectancies, on the one hand, and the actual conditions in the teaching 
profession, on the other, may contribute significantly to morale problems 
in the schools. Rettig and Pasamanick (1959) concluded that the actual 
status of teachers is related positively to job satisfaction but that their 
status aspirations are related negatively to it. Day (1959) found that a 
very marked change in attitudes occurred in beginning elementary teachers 
during their first six months of service. On the other hand, Rabinowitz 
and Rosenbaum (1960) reported a similar change in 


who had taught three years. In both cases, the change was in the direction 
of increased authoritarian attitudes as mea: 


1 sured by the Minnesota Teacher 
Attitude Inventory. All three studies raised a question about the realism of 
teacher training programs. 


elementary teachers 


Needed Research 


A major factor underlying 


em f listed 
by Barr (1961); Biddle TV s in some of these areas have been lis 


yman, and Rankin (1962); Blocker and 
Richardson (1963) ; and Ryans (1960). ankin ( ) ocker 
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CHAPTER V 


Teacher Personnel Research in Higher Education 
ALVIN MARKS and MARVIN N. DILLON 


Sic: 1958, the majority of articles on college teaching personnel have 


een discursive in nature. There is still a marked lack of systematic research 


o s cde) 
n problems related to teacher personnel in colleges and universities. 


Need and Preparation of Teachers 


The need for college teachers was spelled out clearly in an article in 
H igher Education (1961). The authors predicted a college student popula- 
tion in 1970 of approximately 6,000,000, of whom 4,500,000 will probably 
attend full time. They predicted that this enrollment would require an 
increase of college teachers from 232,000 during the 1959-60 academic 
year to 322,000 ten years later unless some new Ways of teaching are 


discovered and accepted. 

The common criterion for adequate preparation for college teaching was 
Possession of a doctorate degree. The National Education Association, 
Research Division (1959) reported that only one in four new college 

did not have a master’s 


teachers had an earned doctorate, that two in ten 
degree, that in general fewer and fewer new teachers had a doctorate, and 


that more and more teachers had less than a master’s degree. Maul (1959) 
Supported this point of view with evidence that the proportion of new 
teachers holding a doctorate had dropped from 31.4 percent during the 
1953.54. academic year to 23.8 percent during the 1958-59 academic year. 
He predicted that colleges and universities would continue to have fewer 
and fewer new seachera holding top academic degrees. Stoikov (1959) 
disagreed with the contention that the quality of college teachers was on 


the downgrade because fewer new teachers possessed a doctorate. Using 
as a criterion the percentage holding a doctorate 


of full-time teaching staff 

rather than the percentage of only new teachers, he presented evidence that 
the quality of college teachers had improved. His research showed that, 
Whereas 42.2 percent of full-time sta held a doctorate in 1953, the per- 
centage had increased to 49 in 1956. He not only pointed out that many 
staff members already employed proceeded to complete the doctorate but 
also indicated that a selective factor probably was present among those 


A: members who resigned. 
lpren (1962) applied a 
rough surveying all 92 of the colleges an 


f adequate preparation. 


different criterion o 
doctorate 


d universities offering 
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degrees in 1960, he determined the number and kinds of regularly scheduled 
courses that dealt primarily with the preparation of college teachers. He 
found that 41 of the institutions offered a total of 55 such courses and 
that only 23 provided one or more courses dealing with junior college 
teaching. (The reader should be aware of the fact that preparation of 
teachers for junior colleges is also conducted by institutions not offering 
doctorates. Consequently, these institutions were not included in Alpren s 
study.) He also reported in his survey that only 25 institutions provided 
a higher education program which consisted of four or more courses. 
In addition, he noted a wide diversity among these courses. Although his 
evidence showed a 72 percent increase since 1953 in the frequency of such 


courses being offered, he concluded that the broad scope of topics covered 
had not undergone much change, 


Ratings and Evaluation 


C s college environments. The CCI 
Which has been used In several studies, contains a number of items asking 
for student Perceptions of the collere staff. 


Thistlethwaite (1959) used the CCI to deter 
environmental press as viewed by N 


of Merit winners who Were attending 36 different colleges and universities. 


Scientists to those that are productive in the arts. 
Sciences. Faculties in the former as compared with 
Were viewed by the students as being more 


threatening, more permissiy, 
the former group of institut 
Continuing with his stud 
asked his respondents to ra 
they thought had contri} 


e, and more controversial than those from 
lons, 


mentioned that these inst 
in their work. 

‘Isaacson, McKeachie, 
research effort in which 
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and Milholland (1963) described a provocative 
8 group of college professors did peer ratings of 
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Am pe d em pep also rated them. Using correlation 
1 è hat on the whole the results were incon- 
clusive. However, enough consistency was found among the variables to 
indicate that a college teacher who has an agreeable nature, gives evidence 
of being stable, shows enthusiasm, and reflects high cultural attainment 
has a substantial likelihood of being regarded as an outstanding teacher 
by both his students and his colleagues. 
] In surveying all the available Ph.D. graduates from Peabody College 
since 1919, Hedges (1962) asked them to respond to the question, “What 
ofessor who exerted the most positive 
41.2 percent of the replies related in 
onship, there was the suggestion 
fluenced more by personal-social 


were the major attributes of the pr 
influence on you in college?" Since 
some way to the professor-student relati 
that respondents believed that they were in 
factors than by any specific teaching technique. 

. Veldman, Peck, and McGuire (1961) proposed an interesting way to 
investigate the personal construct system of professors. They utilized a 


sample of educational psychologists who had been asked to rate their 
The ratings of the judges 


students on a scale of over-all mental health. 

then were submitted to factor analysis to see whether individual construct 
Systems would emerge and, if so, whether they would indicate significant 
differences. Individual construct systems were found, and some significant 
differences were noted. The authors, who pointed out that the general logic 
of such a technique could be applied to any kind of judging procedure, 


Proposed calling it the “J technique. . 
Several college professors, who were found to be reluctant about being 
rated on subjectively judged factors. pointed out that such ratings were 
biased by the raters’ own attitudes. Weaver (1960) and Chansky (1958) 
found some support for this position. 
Weaver wished to examine the rel 


a student expected to receive and his rating o 
investigated whether student criticism would be 
toward his teaching skills, or toward both. 


ins 2, E . 
es dot rt iet ratings of their instructors were biased in the 
directions of the grades that the students expected to receive. More student 
m was directed toward the teaching skills of the instructors than toward 
their personali racteristics. . 

Chandy. ge ER Minnesota eacher Attitude vp Ln Fe 
Students into an authoritarian an into a democratic nr e ue aske 
these students to rate their instructors to determine Lipa heir own 
frames of reference would bias their perceptions. He p : at inei 
Taters were inclined to give democratic ratings, - kon authoritaria 
Taters were apt to perceive their instructors as € ge T 
» A limited number of new rape i appeared in fletiveness of college 

e ni : 
eigen ke i dis continued to reveal little difference 
þe attributed to any 


° ticular method. The 
m achievement that could paren - 


ationship between the grade which 
f the instructor. He also 
directed toward the 
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i igations by Churchill and John (1958) and by Marr and others 
T1960) were kis ples of such studies. Additional information yea 
aspects of teaching effectiveness may be found in the volume edite y 
e (1963). 

u^ ne Department of Health, Education, and Welfare, Office of 
Education (1960) advanced the hypothesis that class size is not the impor- 
tant variable in teaching effectiveness in higher education but that the 
quality of the teaching and learning is. This hypothesis was supported by 
12 papers which were reviewed in the publication, Similarly, the publication 
posited that no one method could be demonstrated to produce more or 
better learning than any other. This hypothesis was sustained in terms of 
the findings in eight papers. Interestingly enough, despite the latter 
hypothesis, the publication contained a further hypothesis that approaches 
to teaching that are problem-oriented improve learning, Positive evidence 
for this hypothesis was explicit in the reviews of 12 studies, 


Salaries and Other Benefits 


The National Education 
t the median salary pa 


exceeded that for women by $1,28 all the men received 
$9,390 or more, and one-fourth received $6,451 or less. 
Bokelman and D'Amico (1963) compared mean salary by academic rank 


between public and private institutions of higher learning for the 1961-62 
academic year with the following results: 


Rank Public Private 
Full Professor . tentes. $10,200 $10,200 
Associate Professor |... 7777 9,280 7,735 
Assistant Professor .. |. 77777777 6,960 6,495 
Instructor ,,... OSE Edie a ias 5,770 5,445 


Tn another study, 


Bokelman and D'Amico ( 
of selected facult 


1961) reported average salaries 
at higher salaries were received 


, 


s Professors, Committee Z (1962) 
presented the res i ial salary survey of a group of 
authors broke down their data 
universities. The survey, which 
eated faithfully every two years 
led attempt will be made here to 


wever, the over-all] salary reports 
did not vary greatly from those of Bokelman and D'Amico. Since most 
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sala m : -— z 
y Apea qud ee pene ai a 
direct comparisons. dnd in mind is agetrpiingi to make any 
dd big showed that, with respect to dollar amounts and over-all 
of ns S, id advances for this biennium compared well with those 
bile V dise ing biennium. However, when consideration was given to the 
ee progress being made toward restoring to instructional staffs their 
se economic status of pre-World Was II days, it was apparent that 
ee at these selected institutions were falling behind those of the 
it li two-year period. The study further showed that, if instructional 
z antes Were to regain their competitive position by the 1967-68 academic 
year, there would have to be an average increment of not less than 7 percent 
hat year, cumulatively measured. The 41 selected colleges and universities 
ad achieved a cumulative rate of only 5 percent in this biennium. 

The data made it quite clear that there had been a leveling between 
ranks. Possibly because of competition with others who desired young 
college graduates, the mean base salary for instructors had risen more 
rapidly than the mean base salary for other ranks. Not only the spread in 
compensation from the lowest to the highest ranks had decreased but also 
the percentage of professors receiving top salaries had declined. During the 
1939.40 academic year, there were 47 professors out of 2,511 receiving 
$10,000 or more. With adjustment for cost of living, it would take $21,500 
in 1962 to equal the 1910 figure. ]f the same proportion were to receive 
this high salary, there would need to be about 116 professors at this level; 


there were only 13. 

Nee: and Baumol (1962) prese 
e the annual self-grading compens 
Prof which was conducted by the 
Professors. The survey was quite comp 


Teena ü A 
g ways in which a professor might be a 
To summarize, the fringe benefits available to college professors were 


not high when compared with those of American business executives. The 


median institution contributed about 8 percent of salary in the form of 
ratios of 6 percent or less of average 


countable benefits. Low benefit-salary 
Salary were paid by 20 percent of the institutions. High benefit-salary 
ratios of 10 percent or more were paid by 30 percent of the institutions. 

Tn general, public institutions tended to lag behind private institutions. 
Ludlow (1959) reported that llege teachers were attracted most to 
individual colleges and univers e made available for and by 
the énicbürasamert given to research and writing. Other important criteria 
Were type d size of institution, geographic location, library facilities, 
Salary, and opportunity for promotion. Lowest in attractive value were 
academic rank Plated leave privileges. and retirement benefits. 
oy f federal money went to 25 


Orlans (1962) mentioned that 68 percer. ° 
] arts colleges 


ma ° s R 
làjor universities and that, although 186 private liberal ar 
financed programs of research, their share of 


Participated in federally 
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of information gleaned 
ation survey for the 1961-62 academic 
American Association of University 
Jete. Suggestions were made regard- 
ble to maximize his fringe benefits. 


nted a digest 


new CO 
ities by the tim 
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such funds was only 1.1 percent. This definitive report is a rich spi «i 
information about the effect of federal programs on teacher personne 
problems. 

An excellent summary of much of the pertinent data from the U.S. Office 


of Education's 1961-62 survey of faculty salaries was given in School and 
Society (1962). 


New Data Sources 


There are three sources of data that may be overlooked because of their 


recency and because of their somewhat restricted distribution. The U.S. 
Office of Health, Education, and Welfare undertook tw. 
tions: The Re 


Reporter, 


o ambitious publica- 
porter and New Dimensions in Higher Education. The 
which through April 1962 has appeared five times, consists of 
well-annotated bibliographies of all investigations reported to a clearing- 
house for studies on higher education. Similarly, only 10 issues of New 
Dimensions in Higher Education have been published. Each number was 
intended to survey in summary form a specific area of concern in higher 


education. Studies supplied to the clearinghouse provided the basic 
background data. 


The American Educational Research 
National Education Associati 
on teaching in the Handbo 
(1963). Chapter 7, “Ratin 
mers, and Chapter 23, “R 
McKeachie were particular 


Association, a department of the 
on, published a collection of research studies 
ok of Research on Teaching, edited by Gage 
g Methods in Research on Teaching,” by Rem- 
esearch on Teaching at the College Level,” by 
ly relevant to the subject under consideration. 
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CHAPTER VI 


Economic, Legal, and Social Status of Teachers 


T " i 1 
HAZEL DAVIS, MARTHA L, WARE, FRIEDA S. SHAPIRO, ELEANOR DONALD, and 
GERTRUDE N. STIEBER * 


Taree sharply contrasting types of research are reported in this chapter. 
Studies of the economic status of the teacher include primarily statistical 
surveys of practice with a limited amount of theoretical discussion. The 
basic information came largely from questionnaires and public records. 
Research on legal status was done by documentary analysis; questionnaires 
seldom were used. Many of the studies of economic and legal status of 
teachers were the work of state and national professional organizations. 

This chapter also reviews research studies concerning social status as 
well as the related topics of teachers’ attitudes, opinions, and morale. In 
contrast to published papers regarding legal and economic status, those 
pertaining to social status and related problems represent the work pri- 
marily of psychologists and sociologists who collected data chiefly through 
interviews, observations, and questionnaires. 

There are few studies considered under more than one topic. Many 
significant studies were omitted because of space limitations. 


Economic Status 


Salaries 


nomic status of teachers continued to put major 
emphasis on salary gains and losses in comparison with other occupational 
groups. For example, in an annual study, the National Education Asso: 
ciation, Research Division (1962b) presented data on trends in teachers 
salaries and compared teachers’ salaries with earnings of other professional 
occupations, Presenting data for the 1961-62 school year, the study re- 
vealed that, since 1950, salaries of the instructional staff, which includes 
classroom teachers, principals, librarians, and building supervisors, gradu- 
ally have increased. By 1961, the average salary of the instructional staff 


rst time since 1940, salaries of the instructional staff 
© earnings of the average employee in manufacturing. 


* The material presented in this chapter represents a condensation of a longer review of research 
which, for reasons of space limitations, the ei 


: bility for deletion of references or for any ch: 
introduced. 
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This comparison was between two groups, one of which consisted of 
workers, most of whom were college graduates, 
s. The other was composed of wage earners 
many of whom were 


professionally trained 
many with two or more degree: 
in manufacturing, government, and all industries, 
unskilled and semiskilled laborers and clerical or service workers with no 
professional training or responsibilities. 

The tendency to make this type of comparison has been due to the lack 
of continuing sources of information about professional salaries in general. 
A series on salaries for selected professional occupations has been released 
annually by the U.S. Department of Labor, Bureau of Labor Statistics, 
since 1961. This series covers salaries of 36 classifications of accountants, 
auditors, attorneys, chemists, and engineers working in private industry. 
The latest issue (U.S. Department of Labor, Bureau of Labor Statistics, 
1962) showed that the mean salary of Class I auditors was less than the 
estimated average annual salary of $5,719 paid to classroom teachers in 
the 1960-61 academic year (National Education Association, Research 
Division, 1961a). x 

The National Education Association, Research Division (1962c) esti- 
mated that the average salary paid classroom teachers at all levels of 
preparation and experience during the 1962-63 school year was $5,735. 


It further was estimated that 23.5 percent of all classroom teachers in 
t only 10.5 percent received $7,500 


1962-63 were paid less than $4,500, bu 
or more, Ld ein um for the entire instructional staff during the 
1962.63 school year was estimated at $5,940. . - 
In a 1960 study of the median salaries of chemists, Roethel (1 A 
found that in each of six types of positions held by men a 
achelor’s degrees, master’s degrees; and doctor's degrees, eS y 
men chemists who were teaching chemistry were earning ronn erably we 
than men chemists who were employed in research, developmen n 
esign, production and inspection. marketing and sales, or adminis ration 
and management. 
Teachers’ salaries, however, h 
eeping up relatively with increas 
Sional groups, they are rising more 
9f other branches of state and local g 
by the US, Department of Commerce, 
(1962), salaries of nonschool employees of state rere 
Increased 54.5 percent from 1951 through 1961, comp 


of 70 percent for employees in public education. 


ave been rising. Although they are not 
es in earnings of many other profes- 
rapidly than the salaries of employees 
overnments. From figures reported 
Office of Business Economics 
and local governments 
an increase 


Salaries of Women 
963) prepared a 
The U.S. D Labor (190: 
Sa rtment of La plication stated that 
Pamph], EUER en workers. The pu t 
almices 4 bre eges n mud 22 million women employed in 1962 
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were performing professional and technical work in April 1962. iun 
number was 58 percent more than that in 1950 and 87 percent more 1 han 
the estimate for 1940. However, the proportion of women professional 
workers in the total women labor force was the same in 1962 as it was in 
1940. The report also stated that the number of women professional 
workers was exceeded only by that of clerical workers, service workers, 
and operatives. The 1.3 million women teachers—excluding college instruc- 
tors—at work in April 1962 accounted for 43 percent of all professional 
women workers. ! 

In 1961, women held 36 percent of all professional positions in the 
United States. The U.S, Department of Labor, Women’s Bureau (1962) 
reported on the degree of salary discrimination against women as com- 
pared with men in the same positions and with the same training. The 


median salary for professional workers in 1960 was $4.384 for women 
and $6,848 for men. 


In view of this difference between profession 
women’s salaries, it is not surprising that, 
approximately 69 percent of all teachers, 


than earnings for other professionals (National Education Association, 


Research Division, 1963a). However, the differences between teachers' 
salaries and the earnings of other professional workers were not due en- 
tirely to sex. 


al men's and professional 
since women accounted for 
salaries of teachers were lower 


Income from Additional Employment 


5 ministrators as well as surveys concerning salary 
scheduling. 


The National Education Association, 
ported that at the beginning of the 196 
teachers in 32 states were employed un 
salary laws. Average salaries of teachers in each state are reported annually 
by the National Education Association, Research Division, a typical ex- 
ample of which was a 1962 publication (National Education Association, 
Research Division, 1962c). 
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der the provisions of minimum 
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In its most ial s Si ation: ucatio; ssocia- 
rece: y y 
nt biennial salar stud : the National Ed tion Associa: 


tion, Rese. ivisi 
iced peer d (1963d) reported on the average salary for all 
vindi dur e erms of size of school system and in terms of other 
personnel ae! a national salaries also were shown for certain other 
x ane demie principals, counselors, and superintendents. Salaries 
sul groups were reported for large public school systems 
E s research studies on salary schedules of classroom teachers 
oe ure employees were those of the National Education 
Neira i ; Research Division (1962a, 1963b). For the 1962-63 school 
Sores E ? o investigations represented a first attempt to obtain salary 
end e basis of enrollment of school systems. For both studies, reports 
compiled from data for systems with enrollments of 6,000 or more. 


In a research study for the RAND Corporation, Kershaw and McKean 
] salary schedule and 


1 1 : ^ 

=~ discussed the relationship between the typical 

acher shortages and other problems affecting education. They proposed 
credits in certain sub- 


tl : 
- salary schedules include supplements for course 
matter fields as determined by local boards of education. These addi- 


tio 1 
nal amounts would enable the schools to attract well-trained persons to 


o with severe shortages. 
topie of pay as a feature in teac 
aad ee and „study. 
Jan ji on, | esearch Division (19 
s and with reasons why many 


edules continued to be a 
One report by the National Education 
6le) dealt with the status of merit pay 
have been discontinued. 


hers’ salary sch 


Legal Status 


eee determine the legal status of public school teachers, one finds it 
"e essary to analyze constitutional and statutory provisions, school board 
as and regulations, and judicial decisions. The most comprehensive 
s y of the legal status of teac h included consideration of both 
atutory provisions and judicia as The Teacher and the Law 
National Education Associatio Division, 1959). Topics in- 
i were admission to the pro ment conditions, relation- 
ih with pupils, retirement, and 
judi e legal principles that govern the sta ¢ 
sch cial decisions received extensiv? treatment in two 
ool law textbooks—one b ilton and Mort. (1959) 
eau (1962). In addition t 
topi ples of statutes to illustrate the 

cs. 
Research Division published yearly 


TI P ANS 
i he National Education Association: Ji 
Bests of court decisions in which teachers were litigants. The most recent 
ational Education Association, Research 

ficant court cases in- 


n 
au was for the year 1962 (N ^ AMA 
ision’, 1963c). The latest annual review of signi 
401 


hers, whic’ 
] decisions. W 
n, Research 

fession, employ 


collective bargaining. 
tus of teachers as evolved through 


recently revised 
and the other 
Jein included 
h to particular 


2 /o. 4 
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volving teachers was prepared by Garber (1968). d obi 
of important state school legislation by the National Education eros 
tion, Research Division furnished up-to-date developments on enac d 
of interest and concern to teachers. Several of the significant outcomes in 
school legislation for the period from January 1 to August l, 1962, m 
cited in a research report of the National Education Association, Research 
Division (1962d). 

Remmlein and Ware (1959) used an opinion poll to evaluate the laws 
concerning protection of teacher personnel. Staffs of state education de- 
partments, officers of state school board associations, state education as- 


sociations, local school staffs, and college professors rated the laws highly. 
Additional Reference: Gauerke (1959) 


Certification, Tenure, and Autonomy 


Armstrong and Stinnett (1962) issued the latest biennial manual on 
certification requirements for school personnel as fixed by statutes and 
regulations in all the states, This report showed upgraded requirements 
for preparation; in 43 states and in the District of Columbia, a bachelor’s 


egree was mandatory for the lowest regular initial certificate for elemen- 
tary school teachers. 


Ligtenberg (1958) 
conclusions relative to 
the purpose of tenure 


surveyed tenure court decisions and arrived at two 
the past three decades: (a) The courts have held that 
legislation is to improve the school system by pro- 

n dismissal for political, partisan, capri 
alicious reasons. (b) The courts have un 
prescribed procedures in accordance with 


| expressed the view that despite the existence 
of tenure laws the right of teachers to continue in their jobs as long as 
their classroom performance is satisfactory has never been adequately 
: : ir study examined the present state and the 
emerging trends in the dismissal of tenure teachers for reasons related to 


n their two-year project, New Horizons for the 


- The report recommended that na- 
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associations create professional standards 
i force standards for the admission and 
retention of qualified persons in teaching and for the separation of the 
incompetent and unethical. The belief was expressed that this aim would 
be furthered by the legal sanction of “professional standards acts.” 
The goal of the organized teaching profession for self-discipline and 
self-determination in matters of teaching competence and professional con- 
duct was advanced when Kentucky pioneered in adopting à professional 
standards act (National Education Association, Research Division, 
1962d). A commission created by this law was given responsibility to 
develop and to administer professional practices criteria through the 


teaching profession. 
Another step in the realization of the 
unique feature in the California tenure law which recognizes (a) expert 


lestimony in dismissal actions before the courts on questions of compe- 


tency, professional conduct, and fitness to teach and (b) the professional 
standards commission of a state-wide professional teachers organization 
Williamson (1960) reported not 


as a source of such expert testimony. 

only that the procedure of expert panel d testimony proved 
sound and workable but also that its court in a teacher 
dismissal action set an important precedent. 


tional, state, and local education 
commissions to establish and to en 


above-mentioned goal was the 


investigation an 
acceptance by a 


Civil Rights 
legal scope of teachers’ freedoms as re- 


vealed by court decisions in four areas: (a) subversive political activities, 


(b) participation in political affairs, (c) membership in teachers unions 
along with the right to strike, and (d) outside activities of teachers. He 
concluded that teachers were restricted. more than most citizens in the 
exercise of freedoms guaranteed by constitutional provisions and that one 

q before entering the 


should become reconciled to these legal limitations 
teaching profession. . . 
The search of the National Educatio 


(1961c) i icti tive legal prov 
into restrictive and protec ive leg c € 
cal activities of public school teachers brought to light few specific stat- 


utory restrictions that impede teachers from participating in political 


affairs at the local, state, oT nationa vas stated that, if restric- 


] levels. It v 
lions do exist, they may be based on written or unwritten school board 
Policies or, perhaps, on the Analysis of judi- 


e of public opinion. ofj 
cial opinions indicated tha hold dismissal, 


fe not likely to up 
Suspension, er nonappointmen litically active teachers so long as 
the activity does not interfere with the teacher's efficiency, takes place out- 
Side school hours, and does not violate limiting state statutes. 

Bryson (1961) studied the legality of loyalty oath requirements of 
public school teachers. He found that prescribed methods for insuring 
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t of po 
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oyalt u g teachers havi i sonable 
yalty q i i have been considered reaso 

l l alifications amon achers SISTE 

by the courts and that the loyalty laws have not been judicially regar ded 


i ief, press. 
as violating constitutional guarantees of freedom of speech, belief, press, 
and assembly. 


Television, Radio, and Programed Instruction 


Legal problems relating to ,esidual rights, royalties, and epee 
of classroom teachers for use and reuse of instructional materials in radio 
and television broadcasting and in programed instruction were 
sidered by the National Education Association, Commission on Pro in. 
sional Rights and Responsibilities and the Division of Audiovisual In 
structional Service. The dimensions of the problems as well as "o Ep 
and guidelines to a fair policy for teachers appeared in a publication o 


proceedings of a conference (National Education Association, Division of 
Audiovisual Instructional Service, 1962). 


Teacher-Board Relations 


Formal relations between teachers org 
tion were the subject of increasing int 
Professional negotiation and collective 
support. The special memo on collectiv 
tional Education Association, Researcl 
statutes and court decisions involy 
Much of this information was bro 
professional negotiation ( 


anizations and boards of educa- 
erest during the period reviewed. 
bargaining were vying for teacher 
e bargaining prepared by the Na- 
h Division (1958) analyzed state 
ing teachers and other public employees. 
ught up to date in a later publication on 
National Education Association, Research Di- 
vision, 1962e). Besides setting forth the procedures for professional nego 
tiation and its differences from collective bargaining, the latter publication 
included an appendix containing a state-by-state digest of statutes, cour 
decisions, and opinions of attorneys general on the issue involved. 


Teacher-Pupil Relations 


Legal liability of school districts and school personnel for pupil injuries 
as well as for measures to prevent accidents in all areas of school activities 
received attention in a comprehensive publication prepared by the Ne 
tional Education Association, Research Division (1963e) for the Natia 
Commission on Safety Education. It explains the law of negligence as } 
relates to school personnel, Summarizes statutes and court decisions cor- 
cerned with school district liability, and emphasizes the responsibility 9 
the district, the administrator, and the classroom teacher for pupjl safety- 
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In stressing safety as an instrument in preventing liability for negli- 
gence, Kigin (1963) directed his study mainly to the school shop teacher. 
He stated that passage of mandatory legislation to indemnify the teacher 
against financial loss in pupil-injury cases appeared to be the most equi- 
table solution yet developed. He suggested a plan for pupil compensation 
on a state-wide basis, akin to workmen’s compensation, with the state 
education department acting as a self-insurer. 

Although the release of pupil records to parents and others has long 
been an educational issue, it has begun to emerge as a legal issue as well. 
Adams (1962) discussed the legal status of the counselor who often is 
faced with the problem of the release of records. To date, the law does 
not seem to offer teachers definite answers in this increasingly important 
field. The most comprehensive study, which included many court citations, 
was done by Bakken (1961). Further research is needed. There is a tend- 
ency for writers to confuse the legal consequences to the teacher for 
voluntarily releasing records with the question of who has the power to 
force the teacher to release records that he may not wish to relinquish. 


Retirement 


Two studies treated the subject of retirement extensively. A study by 
the National Education Association, Research Division (1961b) included 


data on 64 state and local systems to which teachers belong. It reported 
that $7.3 billion was invested in these systems and that there were more 
than 1.85 million active members of the systems, nearly 1 million of whom 
also were covered by social securily. Krislov (1962) surveyed public 
retirement systems whose members also were covered by the federal Old- 

8e, Survivors, and Disability Insurance (OASDI). He em eis ae 
retiring after January 1961 received substantially higher combined nene fits 
than those who retired before social security coverage was added. He also 
ound that coverage through OASDI resulted in improved disability x 
Survivor protection and that retirement systems have improved sub- 


Stantia i iti SDI coverage. 

he Usha oneal aor Research Division (r360, 4 
Sented a digest of 54 reported retirement court decisions. gonna pie 
Years between 1950.60 in a publication similar to others issue n 0- 
90. The cases since 1950 fell into two general categories: (a) uu 
ments with administrative determinations and (b) challenges to legista- 


10 i wr zs .ovisions in retirement statutes. 
n repealing or modifying existing prov 


—| „sheltered annuity—was 
curren i i st to teachers the tax-s 
do c ae s of the Fortieth Annual Meeting of 


a sub; . T 

ject included in the Proceeding A 1 
the National Education Association. National Council on Teacher Retire 
ment (1962). It was reported that some teachers in at least 20 states 
bius participating in tax-sheltered annuity programs. m one 
onuity Was the subject of three issues of the Retirement Income veries 
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(National Education Association, Research Division, 1961d), a series on 
retirement and social security which is published irregularly. Teachers 
were advised to seek competent advice before embarking on a tax-sheltered 
program. 

The transferability aspects of federal. state, and local government. re- 
tirement laws and policies were the subject of a report of the Advisory 
Commission on Intergovernmental Relations (1963). Although most of 
the public employee retirement systems reviewed had vesting provisions. 

j Systems with such features required 15 or more 
years of service before vesting. It was noted that these lengthy service 
requirements sometimes inhibit public employee mobility when it is in 
the public interest. The report showed that, of the 19 states providing for 
intrastate employee transferability, most were between state teachers and 
slate employees Syslems and between state teachers and local teachers 


systems. The right to purchase out-of-state public service credit was 
available in 28 state teachers retire 


that provision should be made for 


vesting after service of no more than five years in 
were among the recommendations of the Commis- 


Social Status 


Social Status is a rather nebulous characteristic of an individual or 
group; it may be determined by one or by a combination of indicators. 
Some of these indicators for teachers are economic level, educational level. 
occupation of father, and images and stereotypes of teachers and teaching 


as an occupation, Some of these indicators tend to lower teachers’ social 
status; others tend to raise it, 


Various aspects of economic status of teachers—salary, employment 
status of Spouse, ext 


Ta-teaching jo} = i iscussed. 
Clearly, the relatively eorom at were previously disc 


: : modest economic Status of teachers tends to lower 
their over-all social status, 


Educational Level 


In a synthesis of research on social mobility, Lipset and Bendix (1959) 
observed that positions of leadership and social responsibility in any 
Society usually are ranked at the top in a hierarchy of prestige, wt 
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Mason (1961) found a relationship between the education achieved by 
the beginning teacher and his social origin: the larger the community of 
origin, the better educated the teacher. He also found that in general the 
children of white collar workers were slightly better educated than were 
the children of blue collar workers. 


Occupation of Father > 


A traditional determinant of social status is father’s occupation. Mason 
(1961) reported that fathers of beginning teachers, to a slightly greater 
extent than those of all teachers, were white collar workers. The propor- 
tion of beginning teachers who were children of white collar woki m 
smaller than that of the average of all professionals. The social E 
ground of beginning teachers varied in different regions as well as in dif- 


ferent types of school districts. ae : i 

The feto Education Association, Research Division iiy pi 
reported on the occupational background of teachers ju T XE 
that teachers entering the profession in five time perio o Fa deno 
1905-14, 1915-24, 1925-34, and after 1931—tended to rellet, f PH 
Status, the changes in occupational distribution o cem postibz. Mf 
through the 50-year period. There was; however, ai larg E Jed ao 
teachers from the managerial and professional rap lebur force: as: a 
Portion from skilled and unskilled labor than in 


ws i ial mobility that 
in. ‘ndi at studies of social mobility 

Lipset and Bendix (1959) indicated that studies ©, Sk rui 

n j x r occupation o 

o ns more than compare the father's occupation d quen qe 
the respondents miss significant aspects. lare teacher's social status. 
the father’s occupation was an aq eoe sape je [jue thst fabat 
porting on a sample in the San 7 The proportion of 


j tinuum. 
ad origins at all levels of the social de cart a canam i 
teachers who were in the upper half of the soc 


y tar e i ary school 
ceded by f i lower half. Male second 

e the pro portion from th ERE Matri T Mi 
: hers den Towest in terms of social class origin. and female elementary 


School teachers were highest. 


hin 

Images and Stereotypes of Teachers and Teaching 
hers is reflected in their images 
On ial status of teac! 1 E pae 
as iam pra = reg those involved in a ng) [A md des 
and oi Farnese (1900) and Pro = Jes . eotypes of teachers. 
and other Eirr forms for characterizations or : "x ac ees 
ed reported Eitri Sue cab qe dangerous cultural 
rerved that these distorted stereotypes P ipei behind reality in 

i 3 

b Although fiction often is ahea E 
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the literature surveyed. Although Belok and Dowling found less vei 
of definite stereotypes of college professors, they noted a disproportio = 
number of undesirable characterizations and concluded that poe 
ing was in the strange position of being highly honored yet constantly 
ridiculed. : —" 
Kuhlen and Dipboye (1959) attempted to discover ways in w : 
prospective teachers differed from prospective members of certain other 
occupations. They found that public school teaching was perceived as 
offering very low potential for substantial economic gain; that those 
selecting teaching had higher needs for nurturance, deference, self-abase- 
ment, order, and affiliation than those choosing other professions; but 
that they exhibited lower needs for achievement, autonomy, and change 
than other professionally directed groups. 
In his analysis of the professional role perceptions of teachers who ex- 
pressed evaluation of their own activities, Drabick (1959) found that 
elementary school teachers generally did least and that special teachers 
did most to create an impression of an educational profession. He pro- 
posed that until teachers engaged in more out-of-school activities which 
they expressed as being professional, they would have little claim to be 


regarded as members of a profession as are doctors, lawyers, and min- 
isters. 


In what might be view. 


ed as an indirect approach to the determination 
of the stereotype of the t 


eacher held by students, Devita and Kaczkowski 
(1959) polled a random sample of high school seniors in the Chicago 
area to determine their Teasons for and against selecting teaching as à 
career. In rank order of frequency of Citation, the negative reasons were 
classified as follows: (a) interest in other careers, (b) expectation of low 
salary, (c) lack of appeal which teaching held, (d) perceived lack in 
their own ability, and (e) uninteresting nature of the work involved in 
teaching. Reasons for choosing teaching were as follows: (a) pleasure 
in working with children, (b) belief that teaching was enjoyable work: 
(c) desire to teach a Particular grade or subject, (d) enjoyment in help- 
ing others, and (e) Perception of the need for teachers. 


O'Dowd and Beardslee (1961 
teacher as seen by college 
professional role for men 


avior of public school teachers in community 
life, Buck (1960) reported 


es 


October 
1963 Economic, LEGAL, AND SOCIAL STATUS oF TEACHERS 


a workable iti i i fecti 
cntadictory expats attained their objectives despite varying and 
edic Sapna Chilcott (1961) found a rather formless teacher 
eave ce E es that the traditional teacher stereotype of 25 
ere à ^ ro en down and has not been replaced yet. He found some 
hal ume at the new stereotype may be that of an attractive and youth- 
ü narried woman who commands considerable community respect. 
He D — reviewed research on social status of teachers since 1925. 
ed nd that it was" difficult to fit teachers into widely accepted social 
us scales and that teachers might be classified best as *middle class 


professionals." 


Additional References: Charters (1963) ; Wilson and Goethals (1960). 


Morale, Opinions, and Attitudes of Teachers 


nd attitudes of teachers were of three 
Marele se v ere purely descriptive of the level of 
(b) th or of teachers attitudes toward a particular item or several items, 
ant ose which attempted to correlate morale with various casual factors, 
val 9 those which focused primarily on various methods of measuring 
Redes o morale. Since many studies involved two or more types of the em- 
ases just described, no effort has been made to classify them. 
"S. hree papers appeared that presented overviews of survey-type research 
fe cerning teacher morale. Summarizing studies made in 20 school sur- 
oe Harap (1959) found that the most common Causes of poor morale 
€ inadequate salary, large classes, poor administration, lack of a daily 
Period of relaxation, unsatisfactory plant, and lack of teaching materials 


E equipment. 
e eier (1959) reported a summary 
:000 teachers in 24 systems participated. He conclude 


D: 7 

Pe teacher morale should be concentrated on (a) E 
hx administrative relations, (b) personnel practices and policies, (c) pro- 
ision for adequate school equipment and supplies, and (d) educational 


leadership, Redefer (1963) reached 4 similar conclusion in a later review 


of 
mae than a score of doctoral studies. —— 
uhlen (1959) described career patterns in 


So : i 
ught to identify satisfactions and frustrations. 
chological needs 


ies of morale, opinions, à 
asic types: (a) those which w 


re studies in which 
d that action to im- 
board of education 


of opinionnai 


public school teaching and 
He attempted to show 
of individual teachers 


the 
relation. b 
etween the psy z s : 
and their perceptions of the ae to which those needs might be satisfied 
cell as the relation between the psychological 
eers and with 


1n th, à 
€ teaching profession, a5 W 


needs ion wi ir car 
of teachers and their gen faction with their 


Shor occupations. The importance of the relationship between the psy- 
exelogical needs of prospective teachers and a realistic perception of the 
€nt to which these needs might be met was rejected in a study by Day 
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(1959). He found that college students preparing for cen, deep E 
unrealistic attitudes toward children and school work prior to Yr qi 
and that these attitudes were modified only slightly upon completion o 
the internship. 

LaBue (1959) concluded that teacher attitudes. as measured by tlie 
Minnesota Teacher Attitude Inventory (MTAI) were related significantly 
to knowledge of the principles of educational psycliology, child develop- 
ment, and child behavior, but were not related significantly to objectivity, 
friendliness, emotional stability, personal relations, and other traits as 
measured by the Guilford-Zimmerman Temperament Survey. 

Chandler (1959) and Mathis (1959) reported on the relation between 
types of salary schedules (merit versus nonmerit) and teacher morale; 
they concluded that, although morale appeared to be a general function 
of a multitude of interrelated variables, the type of pay plan in a school 
System was not a significant variable by itself in determining the morale 
level, 

Two investigations were concerned with attitudes of teachers early in 
their careers, Making use of the MTAI, Rabinowitz and Rosenbaum 


fying to both teachers and pupils. The decline in scores, which was asso- 
ciated with geographical location, was greatest among teachers in New 
York City, However, the decline was not associated with the difficulty of 
the school as shown by mean IQ or economic status of pupils. Compared 
with initial Scores, the data scored indicated less concern with pupil 
freedom and more concern with classroom order and academic standards. 

Mason (1961) found a generally high level of satisfaction with teaching 
among beginning teachers, who were most contented with their social 
» Superiors, students, and parents, and least 


à particularly when their salaries 
were compared with earni g 


the same level of education, 
The National Educa 


ported on the Percepts of teachers who would certainly or probably 


choose or not choose teaching again. As was noted by Mason (1961) in 
this investigation revealed that more 
chers than men and secondary school 
h teaching, 

t would depict a school’s organizational 
asked elementary school teachers and 
or described in each item of an Organi- 
stionnaire characterized their schools. 


climate, Halpin and Croft (1962) 
principals to what extent the behavi 
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Reporti in N 
"n abe on faculty morale in Negro colleges, Thompson (1960) found 
that, while some factors reflected the unique position of the Negro colleges 
in American culture, a significant number of findings were in agreement 
with those of other investigations of teacher morale. 


Summary of Research on Teacher Morale, 
Opinions, and Attitudes 


A review of the research between 1958-63 pointed to the following 


major conclusions: 

l. Morale is a general function of a multitude of interrelated variables 
and dimensions rather than a function of one or more isolated variables. 
However, the absence of adequate instruments that are anchored to a 
comprehensive theoretical conceptualization of morale limits the extent 


to which research can be done effectively. . 
2. The immediate supervisor or administrator is extremely important 


to a teacher's morale. Democratic administration can offset the effects of 


other factors that tend to produce low morale. . 
3. Congruity or lack of congruity of perceptions and expectations of 
School boards and teachers is extremely important to teacher morale. | 
4. Administrators and teachers frequently have a very different view 
of the level of morale and of what is important to teachers’ morale. The 
&reater the discrepancy between their expectations, the lower the se 
5. Preparation programs for teachers are inadequate to the extent that 


they either develop or fail to change unrealistic attitudes about — 
a circumstance that may result in disorientation for beginning teachers 


as well as i d i 
in reduced morale. : 
an he relation of morale to teacher per- 


9rmance and to personnel policies and practices. 
and Rempel (1963); Blocker and 


Additional References: Bentley 
Richardson A Thorndike and Hagen (1961). 
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CHAPTER VII 


Assessment of Teacher Behavior and 
Instruction 


i DAVID G. RYANS 
o 


There HAS BEEN a large quantity of research reports dealing with 
teacher behavior during the five-year interval since the topic was last cov- 
ered in the Review. A wide range of variables has been investigated. An 
overview of the reports suggests that investigations have been piecemeal 
and fragmentary for the most part; only occasional attempts have been 
made to conduct coordinated programmatic research or to relate studies 


to some theoretical model. 


Overview of Trends in Research on Teacher Behavior 


ds are in evidence: 


general tren 
ears to have improved in quality 
ave changed since 1952, 


f£ the REVIEW, “Teachers 


In the reviewer’s opinion, several 


. l. Research on teacher behavior apP' 
in recent years. The state of the art seems to h 
When, in introductory remarks to the June issue © : 
and Nonacademic Personnel," the present reviewer deplored the scarcity 
of well-designed research and the relative dearth of significant investiga- 
lions, Alihoagh 10 years later poorly designed research continues to be 


reported, a larger proportion of the output seems to meet reasonable stand- 
* e studies deal with problems of basic impor- 


ards. Moreover, many of th ith: pro. 
lance, Assuming the validity of the reviewer s opinion that a change for 
the better has taken place, one is tempted to surmise the influences involved. 
It would be difficult to assess the reasons with any assurance; nevertheless, 
on that generally more sophisticated researchers 
are being granted advanced deg that there 1s improved understanding 
of research design and methodology: and that more efforts are being ex- 
tended toward relating empirical results to organized theoretical positions. 

ndoubtedly, one influence for improved research has issued from federal 
Support through the Cooperative Research Program and the National De- 


fense Education Act Title VII Program of the U.S. Office of Education. - 
2. A second trend which is undoubtedly related to the improvement in 
we ema of eno has been an T — given ^ = 

9dologica] pr “terion problem is no longer neg u 
gical problems. The criterion E d data collection 


as it w ds for 
as ; methods bor: 2 
a few years 48> and relatively sophisticated design and 


a = . 
ES been refined and improved; d 
alysis techniques increasingly have been employe" 
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3. A third general trend has been a lessening of attention lo the = 
teacher effectiveness, which often has been elusive in its ramifications be- 
cause of the incomparability of evaluative criteria. Instead, one may note 
an increase in attention to the fundamental problem of the description of 
teacher behavior and to the conditions with which various teacher behaviors 
are correlated. One serious impediment to the understanding of teacher 
behavior has been the frequent failure of researchers to recognize that 
before progress can be made in any field of science there must be a long 
period of investigation devoted to (a) the identification of the elements or 
components that make up the phenomena under consideration and (b) the 
study of their interrelationships—the descriptive phase of research. For 
many years, there appeared to be little recognition that study and assess- 
ment of teacher effectiveness involved the adoption of a value system and 
that value judgments necessarily were relative in nature. The estimation of 
teacher effectiveness must be preceded by (a) description of teacher be- 
haviors, (b) subjective judgment and choice of values and associated 


à particular group as manifesta- 
ell-designed study of the relation- 
teristics and teacher behaviors on 
criteria on the other. The current 
g of teacher behavior rather than to 


antity of research has necessitated 
: ming the role more of reporter than of critic. Only the 
studies that have appeared to be at least fairly adequate are reviewed: 


undoubtedly, some well-qualified ones have been omitted. It also is rec- 
ognized that each rese: 


wy cc! research investigation conducted probably is subject to 
some valid criticism, 


ea The reviewer, nevertheless, chose not to resort to 
petty criticisms. In fact, 


t ^ critique of individual studies will be largely 
omitted from the review. 


General Ref. erences 


recently or are scheduled for publication within 
reader should be aware of these. e 
e Encyclopedia of Educational Research c 
articles by Mitzel (1960) ang Ryans (1960b) devoted to the topic 9 
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Irach effectiveness. Mitzel described important attributes of criteria of 
eacher effectiveness as they must be considered by the researcher and 


discussed the problem from three aspects: (a) product criteria, (b) process 
Ryans also emphasized how important 


conducts research on the prediction 
d the current status of prediction re- 


search, His presentation was organized about topics that were concerned 
with the determination of estimates of the criterion, the choice of estimates 
of predictors, approeches to predictor-criterion relationships, tentative 
generalizations regarding the predictability of teacher effectiveness, and 
probable correlates of teacher effectiveness. 

A second major contribution, which brought together massive amounts 
of information about education, was the Handbook of Research on Teach- 
ing, edited by Gage (1963a). This important volume consisted of four 
major sections of from three to nine chapters each. Part I was concerned 
With a theoretical orientation; Part II, with methodologies in research on 
teaching; Part III, with major variables and areas of research on teaching; 
and Part IV, with research on teaching various grade levels and subject 


matters. The volume provided an excellent bibliographical source. Of par- 
ticular relevance to the study of teacher behavior were chapters by Charters 
ackson (1963), Medley and Mitzel 


(1963), Gage (1963b), Getzels and J 

(1963), Stern (1963), and Wallen and Travers (1963). 

" The American Council on Education published a relatively comprehen- 

Sive volume, Characteristics of Teachers (Ryans, 1960a). In this book, 
yans brought together many of the results of a coordinated program of 

research which involved more than 6,000 teachers, about 1,700 schools, 

and approximately 450 school systems. The staff of the Teacher Character- 

istics Study conducted approximately 100 research projects—many con- 


Sisting of multiple research studies. . . 
epresentative of the over-all effort were the following projects and 
topics discussed in the volume: a) development and analysis of a Class- 
room Observation Record and accompanying Glossary; (b) study of the 
reliability of observations of the classroom behavior of teachers; (c) lon- 


situdinal investigations of observed teacher behavior; (d) factor analyses 
of observed classroom behavior of elementary and secondary teachers; (e) 
Statistical analyses of educational points of view held by teachers: (f) de- 
velopment of and subsequent factor analyses of scales estimating teachers 
attitudes toward groups contacted in the schools; (g) investigation of 
critical incidents in teaching; (h) deve her Characteristics 


lopment of a Teac 
chedule; and (i) response analyses leading to Schedule scoring keys. 
These keys were used to obtain estimates © 


f the following: (a) character- 
istics of the classroom behavior of teachers—e.g., warm, understanding, 
i g, imaginative; 


8 ^ : A itudes— 
ystematic, responsible, stimulating (b) teacher attitudes 
ard school perso 


pe and (c) presage criteria. 
the criterion problem is when one 
of teacher effectiveness. He reviewe 


Such as : nnel; (c) educational 

3 those toward pupils or tow’ ; : 
beliefs: (S) verbal "Paans and (e) emotional adjustment. For the 
eacher Characteristics Schedule. comparisons were made of the scores of 
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teachers who had been classified according to sex, age, marital status, 
length and kind of teaching experience, type of college attended, section of 
country in which they had been teaching, and size of community and of 
school in which their teaching had taken place. 

Among several of the other topics considered were the following: (a) the 
relationships between teacher behavior and pupil behavior, (b) the cri- 
terion problem in studying teacher behavior, (c) the development of a 
theory in relation to teacher behavior, (d) the study of activities of teach- 
ers during childhood and adolescence, and (e) the description of character- 
istics of teachers who had been assessed uniformly high or uniformly low 
relative to classroom behaviors. 

Many of the research studies described in the volume also appeared in 
considerable detail in journal articles which will not be cited here. The 
reader is referred to the bibliography of the over-all report (Ryans, 1963a). 

The National Education Association, Research Division (1962b, c; 1963) 
produced three bulletins which provided data based on well-designed sam- 
pling surveys. These three bulletins, respectively, were concerned with 
furnishing information upon the following items: (a) personal character- 
istics—e.g., tabulations concerned with teacher’s sex, marital status, educa- 
tion, experience, income and employment, duties, and certain attitudes to- 
ward teaching; (b) opinions about teaching-related topics—e.g., amount 
an aiaa a d aaee Teg at nin on ur ait 
eum de id anid. A isadvantages of ability grouping, advisability o 
1 parents, and value of foreign language instruction 
in the elementary schools; and (c) miscellaneous facts relative to teacher's 
poi Se type of college degrees held, years of teaching experience; 

ome, teaching load in terms of size of classes, marital status 


and number of children in family, occupation of father, and participation 
in community activities. 


Publications Containing Selected Research Papers 


" In (1961), issue of the Journal of Experimental Education, edited by 
d + approximately 75 doctoral studies were reviewed. Considera- 
Bon was given to the criteria employed, to the importance of personal 
qualities of teachers, to the problems of motivation, and to the general 


trends and patterns of teacher effectiveness that were suggested by the 
investigations. un 

The April 1963 issue of Theory into Practice, edited by Jackson (1963) 
and the Spring 1963 issue of School Review, edited by Bidwell (1963); 
were devoted to a research-oriented presentation of problems related to 


teacher behavior and instruction, Another collection of papers with a 
strong research emphasis and with significant theoretical implications was 
an authoritative symposium on “CÇ 


urrent Research on the Classroom Be- 
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havior shers icati I duca W. at 

» E of eachers and Its Implications for Teacher Edu ation,” which 

the time of the reviewer's wri ing was scheduled to appear in the Se tembe 
t t P p r 


1963 issue of the Journal oj Teacher Education. 


Theory of Instruction and Teacher Behavior 

Mores ecrire EM ly denies 
is dr edo ond p’ i ge d or heir equa y important 
wenige nd stimulation o urther research. During recent years, 

: a researchers have emphasized these points. 
e that attention be given to theory, Gage (1963b) discussed the 
his d : neoretical models and described several different ones. Some of 
veni els were taken. from fields other than education—e.8.. statistics, 
n opathology, administrative behavior, group dynamics, and small- 
on research. Other models were directed specifically toward teaching- 
"ims Me paradigms concerned with criteria of effectiveness and with 
he teaching process. À strong case was made for a theory of teaching. 


Percepti g i: s 
erception-Diagnosis Theory of Teaching 


hing which proposes 
Is behavior leads to a teacher's 
readiness, knowledge, and the 
inferences from the behavior 
cher takes action. At the same 
or leads to a diagnosis 


a concept of teac 


i 0. Smith (1960) developed 1 
i. à teacher's perception of a pup! 
ur aas of the pupil's state of interest, 
ike. The diagnosis is based on the teacher's 
of the pupil. In light of the diagnosis; the tea 


ti ET T 
ime, a pupil's perception of the teacher's behavi 


9n the part of the pupil of the teacher's state of interest—a diagnosis 
inferred from the teacher's behavior- Thus, the pupil reacts to the actions 
of the teacher. A cycling process arises that ultimately leads to pupil 
achievement if the goals of teaching are realized. 


m of Social Action 


T š 
caching Conceptualized as 4 Syste 


resented a concept of teaching as a system 
agent, an end-in-view, à situation, and two 
t s not in a position to control one 


Smith and Meux (1962) p 


of : 
s Pise : 
Ocial action involving a? 


The agent 1 


Set : dip 
se S of factors in the situation. a d. H 
t of factors—e.g., size of classroom, characteristics of pupl5. owever, 
odi et of factors with respect to the end-in- 
odifiable conditions, 


the 

eye can modify the other set 9' à 
Wat eg. assignments, ways of asking questions. : t 
ich constitute the means by which ends-in-view are reached, consist of 


€ lypes: (a) one concerning the subject matter and instructional ma- 
erning ways of manipulating the subject 


erials a 

nd (b) the other conc 

Matter, and materials. Moreover: the manipulation of the procedural fea- 
1 (a) large-scale maneuvers 


tur E M 
es of teaching also has tw? major aspects: 
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called strategies—referring to patterns of acts that serve to attain certain 
outcomes, such as insuring that certain learnings will be acquired in as 
brief a time as possible—and (b) smaller teacher-behavior components 
constituting tactical implements of strategies which are referred to as 
logical operations—forms which verbal behavior takes as the teacher 
shapes the subject matter in the course of instruction. Thus, logical opera- 
tions are basic to the teaching process. A system was-developed by Smith 


and Meux for classifying verhal teaching behavior into pedagogically 
significant units. 


Information System Model 


Ryans (19602) extended a theoretical model first introduced in 1955-56 
which emphasized the instrumental nature of teacher behavior, the effects 
of pupil interaction and feedback on teacher behavior, the aggregation of 
more elementary characteristics of teachers and the organization of these 
characteristics into molar teacher behaviors, the effect of situational con- 
ditions on teacher behavior at different levels of aggregation, and the 


systematic nature of teacher behavior. Subsequently, Ryans (19632) elabo- 
rated upon this point of view. He sugge 
may be viewed as a communication 
a theoretical model w 
system. 


In the information system model, it is proposed that teacher informa- 
tion processing, which culminates in teacher behavior, is influenced by two 
major sets of inputs: (a) capabilities and characteristics of the teacher, or 
internal inputs—e.g., physical-physiological characteristics, general capa- 
bilities, characteristic abilities-capacities, characteristic behaving styles; 
characteristic affective sets, and retrievable information-behavior—and 
(b) conditions external to the teacher, or external inputs—e.g., objectives- 
goals of learning in a given situation, behavior content to be learned by 
the pupil, learning media available, pupil behavior, administrative policies, 
counseling to which the pupil has been exposed, and the culture of which 
the pupil is a part. Teacher information processing itself is held to consist 
of a series of logical phases tentatively classified as (a) input processing 
—sensing, filtering, analyzing and classifying, and temporary storing © 
inputs; (b) processing of information preliminary to decision making 
and channeling—goal defining and task analyzing, retrieving appropriate 
information-behavior, redintegrating information, and evaluating out 
comes of possible alternate behaviors; (c) decision making—decisions 
concerning content of information to be conveyed, selection of mode of 
teacher behavior or channeling, and control of external conditions; (4) 
processing of information for use, or programming of teacher behavior; 
and (e) execution of the selected teacher behavior. Feedback occurs both 
from the teacher's behavior and from the pupil's behavior in a particular 
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teaching situation. Feedback is hypothesized to influence teacher informa- 
tion processing and teacher behavior in future similar situations. 

. Ryans’ information system theory maintains that in activities involved 
in the formal teaching-learning situation selected information in the form 
of facts, concepts, and rules relating to cognitive, affective, and psycho- 
motor behavior is assembled, organized, and programed for conveyance 
via some medium—e.g., the teacher himself, a textbook, a TV display, or a 
programed lesson—to a receiver-learner. [he manner in which this con- 
veyance takes place is intended to enhance both accurate receipt of the 


information-behavior communicated and incorporation of the information 


into the learners hierarchical behavior repertoire 2n incorporation 
achieved by invoking active participation on the part of the receiver- 
learner. Communication in this context refers to the forwarding of mean- 
ingful information and, furthermore, to the conveyance of information 


that can be used by the receiver-learner. It involves the transmittal not 
only of signals comprising a message in the syntactic sense but also of 
references to the objects and events to which the employed symbols refer. 
Furthermore, it is dependent upon common past experiences of sender 


and receiver, on behavioral sets and states of readiness, and on current 


conditions and circumstances. 


Model Viewing the Teacher as Problem Solver 
T | Fa ¢ spoused a theoretical position in which the 
Bae wed Ha DS. ely blem solver. The position taken 


teacher i i „marily as a pro 
s conceptualized primarily € 
^ : behavior consists of responses that 


is as follows: (a) teacher class A j 

3 T : f pupil behavior 
are i 4 in bringing about a n the form o 

nstrumental in bringing 5 ; (b) when the 


held to b i icular teacher or group; 

e desirable by a particu o : i 
leacher sets his goal of bringing about a particular pupil behavior, he 
le if—that of getting the pupil to make the 


Bener a imse 
E E Loa om certain the pupil can produce de: pe am 
Mes saiia i A eg de ocesihily their 
problems i rari eaching situ EREMI” 
bomossenial pie ee responses to new or we situations; (d) 
skill in solving teaching problems depends " abi 2 dbi rh d 
Mental responses and on the ability to utilize in ormation in vy 
the appropriate instrumental response in a given ns mins ability 
n a cli a jy during* : 

A a pi ime d and (f) solving pen pog un 
mp ian constructed idi ve if i» artem Dunn 
ems that appear to have high face validity, that a wq aoe or 
Donteachers, that are solved more successfully J^ s Bd qum alk 
Supervisory teachers than by randomly selected teachers, 


mately can be shown to result in desired pupil behavior. 


hers who are goo 
ations can trans 
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It may be noted that the points of view suggested by Ryans. Smith and 
Meux, and Turner and Fattu bear a resemblance to one another, at least 
to the extent that teacher behavior is assumed to consist of instrumental 
responses—those responses or behaviors having a logical. problem-solving. 
decision-making basis. 


Center for the Construction of Theory in Education 
i 


A Center for the Construction of Theory in Education has been estab- 
lished at the Ohio State University. A number of papers dealing with the 
nature of theory and of models have been issued from this theory-oriented 
group. Descriptions of theoretical models for education have been devel- 
oped by analogy and comparison with principles derived from different 
areas, such as agricultural economics, pharmacology and pharmacogenetics, 


and general systems theory. A recent paper by Maccia and Maccia (1962) 
was a representative one from the Center. 


Methodology 


Criterion Problem 


Many research workers have discussed the criterion problem in studying 
teacher behavior and teacher effectiveness. Mitzel (1960) described im- 
portant attributes of criteria of teacher effectiveness and used the terms 
product, process, and presage to classif y criteria in terms of their proximity 
to educational goals. Barr (1961) reviewed criteria of teacher effectiveness 
employed in the Wisconsin Studies, including pupil gains, scores on 
instruments hypothesized to measure teacher effectiveness, self-ratings, 
ratings of teachers by other educational personnel, peer ratings, pupil 
ratings, and practice teaching marks, In addition to a discussion of the 
methodology relative to the development of predictors of teacher behavior 
and their validation, Ryans (1960a, b) considered such topics as the 
dimensionality of the criterion, sampling adequacy of the criterion, inter- 
relationships of criterion dimensions, generalizability of criterion dimen- 
sions, requirements of adequate criterion measures—e.g., validity or rele- 
vance, reliability, feasibility—and approaches to criterion measurement. 
These approaches included not only direct measurement of the ongoing 
criterion behavior but also indirect measurement—i.e., an assessment 
either obtained from a Preserved record of the behavior-ir-process OF 
derived from judgments of nontrained observers that were based on recall 
of past behavior or on evaluation of a product of the criterion behavior 
or on a quantifiable estimate of concomitants of criterion behavior. 

In keeping with their Point of view of teacher behavior as problem 
solving, Turner and Fattu (1960a) discussed various types of criteria 
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related to effective problem solving: (a) ultimate criterion, which is 
complete and uncontaminated—for which data usually are unattainable; 
(b) consistency criteria, which are concerned with whether problem solv- 
ing is compatible with information available for determining the solution 
of a problem and with whether the solution can be inferred from some 
type of empirical evidence, rationale, or theory; and (c) consensual 
criteria, which pertain to obtaining a consensus relative to what constitutes 
problem-solving behavior. A preference for the consistency criteria was 
a 


indicated. 


Classifying Teacher Behaviors 


Medley and Mitzel (1962) suggested a three-dimensional scheme for 
classifying teacher behaviors as an aid to researchers. Teacher behaviors 
were classified as follows: A—the proportion of situations in which a 
behavior is effective; B—the proportion of pupils affected by the = 
havior; and C—the proportion of teachers for whom the particu : 
behavior works effectively. The most effective teaching behaviors wouk 
be those high with respect to A, B, and C. Obviously, behaviors x on ; 
B, and C probably are not worth trying in teaching. The am elke 
studying different combinations of A-type, B-type, and C-type |] s: x 
Was discussed. In terms of defining the effectiveness of a Tem a cs 
havior, it is important to consider the total number of teachers, P pils, 


and situations for which it is effective. 


ing 
Observation and Assessment of Teaching 


Medley and Mitzel (1958b, 1963) considered the Does emer 
ment of teaching. Medley and Mitzel (1958a, 19 3 ^er scus 
reliability of the observation of teacher classroom be s ih 

Remmers (1963) described various kinds di i : the semantic dif- 
teaching research and also treated sociometric methods, 


i es. 
ferential, Q techniques, and other rating Procon Division (1962a) pre- 
The National Education Association esear 
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Pared a memorandum concerning amie roe for their use. Ryans 
m: Glossary together with a pe an edel which was used 
960a) d i her Characteris S 
; escribed the Teache 


i ] ior sets of teacher 
te three major se e 
d ta teristics Study to estima n 
eroi i erbal-semantic 
liso bohuciors 8 well as teacher attitudes, beliefs, v 


À ri ments have been 
ability, and emotional adjustment. Although other instrur 


ive. i 

a Ped, those described are repere ior "he use of his — id 

Solio 60a) provided 4 e : 1961). The major 

Interaction iB the classroom (indus E, mi edens 

Innoyan a he use 0 am 3 UM 

i jon of this system was" Jassroom behaviors. 
tically ddr: formation about spontaneous = 

o 


423 


Review or EDUCATIONAL RESEARCH Vol. XXXIII, No. 4 


Bowers, Davis, and Bowers (1962) described their Index of sie 
and Values which is intended to measure teacher attitudes toward self an 
others. Travers and others (1961) reported the use of an instrument for 
measuring teacher needs-motivation 


Interpersonal Relations in the Classroom 


At least two studies have made use of pupil accomplices to study teacher 
behavior and the effects of teacher behavior upon -other pupils. Keislar 
and McNeil (1959) employed pupils who were instructed to show pleasure 
under certain experimental conditions of individual spelling instruction 
(tutoring). In order to gain information about the effects of different 
teacher control techniques on classes and their reaction to the teacher, 
Kounin, Gump, and Ryan (1961) reviewed a number of studies in which 
students had played disruptive roles in the classroom and had evoked pre- 
determined—that is, contrived—responses from the teacher, They recom- 
mended that researchers should be allowed to enter classrooms to study at 
first hand interpersonal relations of students and teachers, 


Summary of Teacher-Student Variables 
Considered in the Literature 


To provide a ready reference to teacher-stud 
considered in the literature of the past five y 
prepared, For Convenience, the list which c 
chapter has been organized under these rubrics: (a) internal inputs to 

i ssing and behavior (physical-physiological char- 


organism, characteristic 
» characteristic teacher behaving styles, characteristic 


mation) ; (b) external inputs to teacher 
; i essing; (d) outputs of teacher 
information pr i 


à D à er behaviors; and (e) products of teacher 
behaviors—pupil behaviors, (See Appendix on page 434.) 


ent variables that have been 


Internal Inputs Contributing to Instructor Behavior 


> and their relation to teacher behavior. Exceptions 
y Ryans (1960a) and research carried out by 


Lucio and Wenger (1961) on some interesting hypotheses relating to 


autonomic functioning, 

Studies of the capacities an 
over the years. (The category, 
will not be discussed here, sin 
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IDE than inputs per se. Moreover, there is no relevant literature 
Kr or review.) Only a few studies will be mentioned, although 
B csr from test scores and college marks appeared frequently 
te se E = focused on other aspects of teacher behavior. Ryans (1960a) 
i findings on both verbal-semantic ability and academic abilities 

teachers. In a factor analysis of Q sorts of teacher classroom behavior. 
Wallen and others (1963) found *much academic activity versus little 
academic activity" as a major factor. Smith and Meux (1962) considered 
the abilities of teachers from the standpoint of logical operations that 
were observable in thé verbal behavior of the teacher as the subject matter 


was presented. 

Behaving styles of teachers ha 
ers opinion, these should be considered pri 
teristics, or source traits, rather than teacher 
also may be viewed as teacher behaviors when they occur in the classroom 
as the output or part of the output of teacher information processing and 


decision making in a particular teaching-learning situation. Although the 
behaving styles reported covered a wide range, certain styles appeared to 
have been observed with some regularity by different researchers. For 
example, a “friendly, warm, understanding, sensitive, sincere, empathic, 
Tapport-maintaining” sort of style pattern was reported by Bowers, Davis, 
and Bowers (1962), Hodgson and Horst (1959), Medley (1962), Medley 
and Mitzel (1959), Nunnally, Thistlethwaite, and Wolfe (1963), Ryans 


(1960a), and Wallen and others (1963) as well as by other investigators 
uring recent years. Similarly, a “responsible, systematic, businesslike. 


dependable, well-prepared, efficient-presentation, student-achievement- 
tern of style was reported by Heil and 


Oriented, order! canized" pat 
Washburne Ea Er Horst (1959). Medley (1962), Nunnally, 
Thistlethwaite and Wolfe (1963). Ryans (19602) . and Stern and others 
(1960). A pattern of styles involving “stimulating, imaginative, achieve- 
Ment-motivating, active. resourceful, dynamic” teacher characteristics was 
Teported by Catsonas and Kaiser 1962), Hodgson and Horst (1959), 
Medley (1962), Ryans (19602), and Wallen and others (1963). A “per- 
Sonally acceptable attractive, good appearing, verbally expressive, com- 
Municative professionally impressing” style pattern was reported by 
: (1959), Medley 


OWers, ji ; 962), Hodgson and Horst 
s, Davis, and Bowers (1 D HEAR and Wolfe (1963), Ryans 


and Mitzel (1958), Nunnally, T! 
60a), and Stern and others 1960) . 
nother type of pattern stu 3 

With a “di D s her- tarian” style versus an 

t t tive, teacher * : 

indirect, 2d cr pen nn child-centered style. Flanders (1960a, b; 
1961). Spee ere Ps (1960), and Wilk and Edson (1962) studied these 
ee As is apparent in the list 
Yles also have been noted in the resea” 
fr ective sets of teachers have been stud 
“quently occurring d int 


ve been studied extensively. In the review- 
marily as underlying charac- 
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f investigators had to do 
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with what might be called traditional versus progressive orientation or 
teacher- and academic-centered versus child-centered orientation. Christen- 
sen (1960), Cook, Linden, and McKay (1961), Kerlinger and Kaya 
(1959), Nunnally, Thistlethwaite, and Wolfe (1963), Ryans (1960a). 
and Schmid and others (1962) investigated this dimension. Attitudes 
toward pupils were studied by Hoyt and Cook (1960), Ryans (1960a), 
and many other investigators who employed the Minnesota Teacher Altitude 
Inventory. Personal adjustment was explored by Ryans (1960a, c), Soar 
(1962), and Stern and others (1960) ; introversion-extroversion, by Cook, 
Linden, and McKay (1961). Role expectations of teachers were considered 
by Biddle, Twyman, and Rankin (1962), Rosencranz and Biddle (1962), 
L. M. Smith (1960), and Stern and others (1960). 

The teacher’s retrievable information, or knowledge of subject matter 
content, general culture, and teaching techniques, was involved in reports 


by Cosgrove (1959), Cotsonas and Kaiser (1962), Ryans (1960a), and 
Smith and Meux (1962). 


External Inputs—Conditions External to the Teacher 


With respect to external inputs to teacher information processing— 
inputs that come from the environment in which the teacher operates— 
the published studies have been somewhat limited, at least insofar as the 
reported research has indicated an attempt to relate environmental condi- 
tions of the teacher to teacher behavior. Wittrock (1962) studied the effec- 
tiveness of directions provided for the teacher. Ryans (1960a) attempted 
to establish relationships between (a) certain beh 
characteristics on the one hand 
other—e.g., grade level and sub 
ing occurred, size of communi 


aving styles and teacher 
and (b) environmental conditions on the 
ject taught, size of school in which teach- 
ly, socioeconomic status of community in 
which teaching took place, cultural level of community, and methodologica 
emphasis of the school in which teaching was done. 
Nunnally, Thistlethwaite, and Wolfe (1963) identified factors involved 
in the instructional environment. These factors were divided into two sets- 
One set was concerned with the faculty; and the other, with student condi- 


tions. The latter set included intellectual drive, competition, scientific inler 
est, and pressure against scholarly activities, 


Teacher Information Processing Behavior 


Relatively few attem 
the “black box” whe 
teacher—and the ext 
system characteristics 
the teacher's cognitive s 
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ree re e T 
priate to the reaching obje tiv d bel ey E e E 
arise d pen g obj ierat and behavior content to be learned. This 
al area is one in which both attempts to conceptualize and subse- 

quent research to test the construct validity of propositions growing out 
of inferences about how the teacher processes information ae needed: 
As previously noted, Smith and Meux (1962) attempted to describe 
certain logical operations involved in teaching as the teacher generates or 
programs strategies—i.e., patterns of teacher behavior that serve to attain 
outcomes, such as insuring certain pupil learnings. As discussed in the 


Ki section on theory of instruction and teacher behavior, Ryans 
63a, b) attempted to describe teacher information processing—i.e., 
as the teacher makes decisions 


YR ie nonobservable processes involved 

a : , R ae i 

nd channels behavior to attain certain objectives—in terms of four 
ssing of information 


hypothetical phases: (a) input processing, (b) proce: 
preliminary to decision making and channeling, (c) decision making, and 


(d) processing of information output for use—i.e., programming of be- 


havi » : r 
1avior—preparatory to conveyance of information to the pupil system. 
teacher behavior in a particular 


These phases are presumed to culminate in 

Situation, or in what Smith and Meux referred to as teacher strategy 
employed in a particular teaching situation. Important to Ryans’ formula- 
tion is the concept of negative feedback, which emphasizes the effect of a 


teacher’s behavior and of a pupil’s response to that behavior on future 
teacher behaviors in similar situations. The feedback information serves 


as an additional input for future teacher information processing. 
Turner and Fattu (1960a) viewed teacher 


information processing 
essentially as problem solving—i.e- the solving of problems related to 
;s of Ryans, Smith and Meux, 


le ; A - r " 
cacher situations. The view and Turner and 
attu are in no sense antithetical; instead, they overlap considerably. 

ncerning t 


One of the few empirical studies co he influences of feedback 
9n teacher behavior was that of Gage: Runkel, and Chatterjee (1960), 
in which the effects of feedback from pupils to teachers was studied. Com- 
Parison of experimental and control groups of sixth grade teachers indi- 
Cated that, when teachers were provided with information obtained from 
their pupils regarding how the pupils described their actual teacher and 
how they deseribed their ideal teacher on 12 items of teacher behavior, 
a) teachers’ behaviors changed—as indicated by subsequent pupil descrip- 
“ons of their actual reading the direction of the pupils’ initial descrip- 
lion of their ideal teacher, and (b) teachers became increasingly accurate 


i k : x 
. Predicting their pupils’ descriptions of their teacher. 
ors—Outputs of Teacher 
jon Processing 


ied major approaches to research on the o 
er behaviors—have been through (a) the descr 
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behavior based on direct observation of acts of teachers in teaching situa- 
tions and (b) the description of teacher behavior based on inferences 
arrived at through observation of the acts and achievements of pupils in 
the teachers’ classes. The first approach describes teacher behavior in par- 
ticular teaching situations in terms of a logical classification of acts that 
can be directly observed; the second assumes that certain pupil behaviors 
and estimated achievements reflect teacher behavior that may have occurred 
in a given situation. 


D 


Description of Teacher Behavior Based on Direet Observation 


With regard to the description of directly observable outputs of teachers, 
various classifications have been suggested. In Ryans’ (1963a, b) most 
recent conceptualization of the outputs of teacher information processing; 
five major categories of teacher behaviors, or strategies, were hypothesized: 
(a) presenting teacher behaviors; (b) motivating teacher behaviors; (c) 
organizing-coordinating teacher behaviors; (d) 


evaluating teacher be- 
haviors; and (e) counseling- 


guiding teacher behaviors. No evidence i$ 
available for the validity of this conceptualization of teacher outputs- 
Furthermore, it is probable that in a given teaching situation teacher 
behavior may consist of some coml 
these five categories. 


Smith and Meux (1962) conceptualized the major outputs of the teacher 
and of the teacher's logical operations to be large-scale maneuvers which, 
as has been noted, are termed strategies. These outputs look toward the 
efficient acquisition and reinforcement of cert 

Similarly, Turner and Fattu (1960a) 
consist of acts that are instrumental in b 
state is definable in terms of desirable p 
is directed toward effecting a desired re 
making certain that the Pupil can prod 
Turner and Fattu attempted to infer the 
behavior from the extent to 
among teachers of different 


ae i in 
bination of the separate behaviors 1! 


ain learnings in pupils- 
considered teacher behavior tO 
ringing about a goal state. This 
upil behavior. Teacher behavio! 
sponse by the pupil and heir 
uce that response when require: 
level of effectiveness of a teacher 
which problem-solving tests differentiate 
: abilities and degrees of training and experience 
Heil and Washburne (1962) classified teachers as type A (turbulent a? 
impulsive), type B (self-controlling, orderly, and work-oriented), a^^ 
type C (fearful). Type B teachers were found to be higher than type ee 
type C with respect to demonstrating leadership, identifying with other? 
and helping others, exhibiting dramatic behavior, and showing methodic 
behavior. However, type B teachers were lower than the other groups a 
respect to humor, acceptance of impulses, and aggression. 


Description of Teacher Behavior Based on Inferences 


Various investigators have inferred teacher behavior from pupil a 
havior and/or pupil achievement. Medley and Mitzel (1959) supP 
498 
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evid FN 
iie : 2 support the proposition that pupil-teacher rapport indicated 
oo iig and understanding behavior of the teacher. 
an i i i ; 
Ee ie idi ump (1961) reported pupil aggression and misconduct to 
5 punitive teacher behavior. 
io mvesHsstions avti ^ F à " 
toda T "Ye Age involving increments in pupil learning pointed to a 
La nference about the nature of teacher behavior. Heil and Wash- 
s ( 1902) reported that pupils’ gains on the Stanford Achievement Test, 
papi s' positive feelings, and children's social acceptance of one another 
ggested self-controlling, orderly, and work-oriented behavior of the 


teacher. Flanders (1960b) found increased pupil learning to be greatest 
ho displayed flexible behavior—i.e., behavior 
t or dominative teacher influence in 
tion of the goal was clear 
her influence in other 


in the classes of teachers w 
that reflected predominantly direc! 
"oce situations, as when the student's percep 
nd acceptable, and predominantly indirect teac 
Situations. 
ao (1961a, b) reported differences 
b ondary school in the extent to whic 
havior reflected subpatterns of teacher behavior. 
fut ed teacher behaviors and observed pupil be 
gh in the elementary school but relatively low in second 
be one of his recent papers, Medley (1962) described pio mide 
ibn. terms of directly observable acts of the teacher and in terms of 
| ions of pupil behavior. Eight major dimensions of teacher classroom 
behavior were suggested: (a) activity of pupils—teacher behavior eliciting 
d large amount of pupil responses (b) presence—teacher behavior pro- 
ucing high classroom order and favorable impression upon observer; 
(€) initiative—teacher behavior permitting à high degree of pupil initia- 
live in determining class activities; (d) informative teaching—teacher 
'ehavior emphasizing objectives, information, clarification, and summariza- 
Hon; (e) imaginative teaching—teacher behavior stressing individualiza- 
Hon and use of interesting and original devices and materials; (f) con- 
Sideration — teacher behavior showing an awareness and consideration of 
Pupils’ problems and feelings; (g) warmth—teacher behavior exhibiting 
Supportive qualities; and (h) disapproval—teacher behavior including 


r y : 
quent reproof and criticism of pupils. 


between the elementary and 
h so-called productive pupil 
Relationships between 
haviors were relatively 
ary school classes. 


Products of Instructor Behaviors: Pupil Behaviors 
I A substantial number of studies dealt directly with pupil behaviors. 
b. Most instances, the assumption 2$ been implicit that the pupil 
= aviors were an effect or consequence of teacher behaviors. Most of the 
tudies tested hypotheses regarding the influence of variously estimated 
Sacher variables and variously defined and measured popi m " 
Tyan (196 a high correlation between the development o 
ep ES e = f chemistry students on 


i 
*rest and liking for subject matter on the part o 
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the one hand and teacher prestige on the other, as judged hy students 
ratings. Bryan also found that high school teachers can effectively use 
information gained from student-reaction feedback as a means of improv- 
ing their image with students. Working with student teachers, Wilk and 
Edson (1962) discovered that, when the learning climate was characterized 
by supportive-encouraging teacher behavior, when a variety of teaching 
materials and procedures were employed, and when frequent pupil activity 
was evident, there were fewer disruptive pupil behaviors. 


z Pupil Achievement and Learning 


Christensen (1960) studied pupil achievement and pupil affect-need in 
relation to subpatterns of teacher behavior. He reported that teacher per- 
missiveness was unrelated to pupil achievement but that teacher warmth 
was significantly related to pupil growth in vocabulary and arithmetic. As 
already noted, Flanders (1960b) found that indirect teacher behavior 
fostered increased pupil learnings but that the pupils who learned most and 
scored highest on classroom attitude scales were those exposed to flexible 
patterns of teacher influence. Heil and Washburne (1962) noted that pupil 
gains on the Stanford Achievement Test were greatest with self-controlling 
teachers, intermediate with turbulent teachers, and least with fearful 
teachers. 

Medley and Mitzel (1959) reported nonsignificant differences for reading 
growth and problem-solving skill of pupils studied in relation to teacher 
warmth-friendliness, teacher verbal emphasis, and teacher acceptance 0 
pupil autonomy. Compared with the modest degree of association obtaine 
between verbal emphasis of the teacher and pupil-teacher rapport, the extent 
of the relationship between pupil-teacher rapport and teacher warmth- 
friendliness was defined more clearly. 

Turner and others (1963) reported on a series of studies associated 
with the thesis that effective teachers are persons who are good solvers 9 
teaching problems in a wide variety of teaching situations. The results 
from one of the studies indicated that pupils taught by high-scoring teach- 
ers—good problem solvers—in grades four and tee attained significant Y 
greater achievement than pupils taught by low-scoring teachers in the 
same grades, j 

Page (1958) found a measurable effect on student effort and improY®: 
ment in learning when secondary school teachers wrote comments, believe 
by them to be “encouraging” on student papers when the papers wer? 
returned. Rosenfeld and Zander (1961) determined that the extent to 


which. students set their aspired grades was congruent with their perceive 
capacities. The observed rela (a) 


teacher disapproval when a felt 
that he could, (b) indiscrimi 
adequate performance, with 
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E working with student teachers, Wittrock (1962) found that those 
vho were told that their final course grades would depend on the amount of 
pupil gain in achievement produced actually did produce significantly 
more pupil gain than did a control group of student teachers who did not 
receive the set involving reward for pupil gain. 


Teacher Effectiveness 


Ryans (1960b) summarily reviewed the problem of the prediction of 
teacher effectiveness. However, as noted in the introductory remarks to 
this chapter, the number of studies concerning teacher effectiveness has 
diminished during the past few years. The philosophy involved will not 


be reiterated here. 
In an intensive study of groups of teachers who had been uniformly 
friendly teacher behavior, 


assessed as high or low with respect to warm- 
teacher behavior, and stimulating-imaginative 


a number of characteristics which 
mpared with the low and 


organized-businesslike 
teacher behavior, Ryans (1960a) noted 
were uniquely associated with the high as co 
with the low as compared with the high groups. 


Torrance (1960) analyzed the daily logs of a à 
teachers who were judged from their logs to be most effective and least 


effective were compared with respect to cognition, memory, convergent 
thinking, divergent thinking, and evaluation. The more effective teachers 
did tend to report more thinking activities—convergent, divergent, and 
evaluative—than did the less effective ones. On the other hand, the less 
effective teachers did cite more recognitive and memory experiences than 
did the more effective ones. 

Approaches to the merit rating of teachers have al 
of many school people. Burch and Ferguson (1961) 
cated approach employed in connection with the a 
West Hartford merit rating of teachers. 


ctivities of teachers. Those 


ttracted the attention 
described a sophisti- 
dministration of the 


College Instruction 


conducting and reviewing research on 


McKeachi leader in 2 } 
teaching "- has ner at the college and university level. In his 
chapter in the Handbook of Research on Teaching, McKeachie (1963) 
^ the area of college and university 


Considered in some detail research in à 
teaching. A bibliography of more than 200 titles was provided. 


Limitations of Research Findings 
note certain limitations 


is appropriate to ain 
] as restrictions on the 


; In a review of this nature, it e 
‘volved in the research findings reported as we : 
pplication of the findings Several will be mentioned. 

431 


Review or EDUCATIONAL RESEARCH V ol. XXXIII, No. 4 


The results of studies reviewed have pertained to identifiable patterns of 
teacher behavior and to related topics. These results reflect contemporary 
behaviors of teachers. Regardless of how valid the results may be for 
the time at which the studies were conducted, the findings for replicated 
investigations may be quite different at another time or, for that matter. 
in another cultural setting. One may generalize in time or from culture to 
culture only when conditions are approximately constant. In society, change 
often is rapid. Conditions sometimes vary widely over geographical space. 
It is important to note that what has been recorded may or may not he 
generalizable to the future or to different cultures. 

The data obtained and the findings reported relative to teacher character- 
istics and teacher behavior do not indicate what value decisions should be 
formulated. The results of these research studies not only are descriptive of 
particular samples of teachers but also are variable relative to the cultural 
milieu which provides the setting at the time and place of the investigation. 

Tt also must be recalled that one hazard involved in the utilization of 
most behavioral research findings rests in their application to individual 
cases. For a large group of teachers, considered as a group and viewed 
actuarially, a principle of teacher behavior may be stated with rather high 
probability. For the same behavior, one would he able to make such a 
statement or prediction with regard to one individual teacher with a 
probability perhaps only slightly higher than chance expectation and 
therefore with substantially reduced assurance. 

Several studies have indicated that assembled descriptions of teacher 
behavior are not equally generalizable to all kinds of teachers. Just as the 
reliability and validity of tests or measurements of a certain psychological 
construct have come to be viewed as relatively specific in nature—i.e» 
limited to the particular instrument used, in a defined class of situations: 
for a defined purpose—so also must the results of teacher characteristics 


research be considered. Although some findings seem to possess considera- 


ble generality, it would be well to keep in mind that present behaviors 
probably vary in only partially known ways from one grade level to another: 
from one subject matter to another, and from one social context to another- 

et is the reported investigations of teacher behavior and teacher 
characteristics were of a taxonomic or descriptive sort. Although they gre 
appropriate, descriptive observations per se do not provide explanations 
of teacher behavior. When interrelationships among teacher characteristic? 
or among teacher characteristics and pupil haranera are reporte 
they often are of a correlationa] natüre. Moreover, antecedent-consequent 
relationships usually cannot be inferred. Longitudinal studies should be 
conducted to help clarify conceptualizations about such matters. However: 
with respect to most teacher characteristics and behaviors and with respec! 
to the conditions that influence them, the current state of information does 
aot permit the luxury of inferences about producer-product relationships: 


432 


October 1963 AssESSMENT OF TEACHER BEHAVIOR AND INSTRUCTION 


Future Research on Teacher Behavior 


In a recent article, Ryans (1963c) suggested a limited number of possi- 
ble directions of future research. A consideration of the contents of the 
article and additional reflections by the reviewer pointed to the following 
courses of action that might be taken to give direction to future research: 
(a) taking account of shortcomings of past research; (b) analyzing areas 
of teacher behavior and the teaching-learning process: (c) evaluating 
recent trends in teacher behavior research (d) surveying the changing 
social, economic, and educational conditions; (e) reviewing related meth- 
odological problems; and (f) considering the theoretical framework of 
teacher learning. Some of the directions of future research might include 
the following: (a) extension of the identification of properties of teacher 
behavior; (b) study of the relevant aggregates or compounds of teacher 
behavior that have importance for the teaching-learning process; (c) atten- 
tion directed toward prediction which conforms closely with the “pre- 
diction research paradigm"; (d) methodological investigation oriented 
toward the development of tools and techniques for obtaining increasingly 
valid information about teacher behavior; (e) systematic inquiry aimed 
at the operational description of pupil behavior goals; (f) formulation of 
relationships between molar teacher behaviors and external or environ- 
menta] inputs, as well as between molar teacher behaviors and the internal 
inputs or teacher characteristics; (g) determination of the effects of feed- 
back to the teacher based on his previous behavior and of the results of 
feedback to the teacher on pupils; (h) exploration of how teacher behaviors 
vary and of how they relate to observable, operationally defined a 
of atypical pupils, such as the gifted, the retarded, and the handicapped; 
(i) analysis of the influence of different kinds of teacher ue hen pro- 
&rams and courses, of practice teaching and its variations, and o ists 
other educational experiences on the developing patterns of teacher c m 
acteristics; (j) ascertainment of the influence of api peo be ears 
teachers, both organized and informal, on patterns of teacher came 
tics and behaviors; (k) completion of comparative studies «| be ws 
Istics of teachers of different national politen andia tom pre a 

experimentation with the use © simulation 1 á : n see 

teacher behavi ing teacher education; (m) study of the programi 
ming of ee (n) description and analysis of <p ^ 
an organi coorta of automation and of 
a i i i i e influence oi au 

ological media; (o) specification Ld ien. quiis ali a 
nges on teacher behavior; (q) deter- 
tary teacher behaviors aggregate to form 
ment of a conceptual frame- 
arious operational condi- 
tons with the various teacher ] iyd T p bei a 
Pupil behaviors; (s) systematic theoretical and exper p 
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i ssi isi aking, output 
regarding the nature of teacher input processing, € eld n 
* i i 2 investigation o 
i ntrol; and (t) investiga D 
rocessing, and information co E dw 
jain of implementing innovations and new developments into tl a 
repertoire of behavior. 


Appendix 


Summary of teacher-student variables referred lo 
in representative reports and papers, 1958-63 


Inputs to Teacher Information Processing 


Internal Inputs—Teacher Characteristics 


tveans 1960a, 
1. Physical-physiological characteristics (Lucio and Wenger, 1961; Ryans 19 
1963a) 


2. General capabilities of human organism (Ryans, 1963a) 


3. Characteristic abilities-capacities 
a. Academic ability (Wallen and others, 1963) 
b. Verbal-semantic ability (Ryans, 19602) 


4. Characteristic teacher behaving styles 


a. Achievement-motivating (Hodgson and Horst, 1959) 
Enthusiastic (Cosgrove, 1959) 
Imaginative (Medley, 1962) 
Inspiring, stimulating (Cotsonas and Kaiser, 1962) 
Stimulating (Hodgson and Horst, 1959) 
Stimulating, imaginative (Ryans, 1960a) 
Vigorous, dynamic (Wallen and others, 1963) 
b. Avoidance (Cook, Linden, and McKay, 1961) 
Preclusive behavior (Cogan, 1958a, b) a hwaite, 
c. Businesslike, responsible, systematic (Ryans, 1960a; Nunnally, Thistlethw 
and Wolfe, 1963) 
Orderly (Stern and others, 1960) 
Organized (Hodgson and Horst, 1959) 
Planning and procedures emphasis (Cosgrove, 1659) 
Preclusive behavior (Cogan, 1958a, b) 
Presence (Medley, 1962) . 
Self-controlling, orderly, work-oriented (Heil and Washburne, 1962) 
d. Conjunctive behavior (Cogan, 1958a, b) 
e. Consideration (Medley, 1962; L. M. Smith, 1960) 
Emotional climate—warmth, friendliness (Medley and Mitzel, 1959) d Wolfe; 
Frendin understanding (Ryans, i960a; Nunnally, Thistlethwaite, an 
Pupil rapport-maintaining (Cosgrove, 1959) 
Sensitive, sincere (Hodgson and Horst, 1959) 
Sensitive, sympathetic (Cotsonas and Kaiser, 1962) 
Supportive (Bowers, Davis, and Bowers, 1962) 
Warmth (Christensen, 1960; Medley, 1962; Wallen and others, 1963) 
f. Critical (Stern and others, 1960) 
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g. Direct—teacher-centered (Flanders 1960a, b; 1961) 
Dominant (Stern and others, 1960) 
h. Disapproval (Medley, 1962) 
i. Exhibitionistic (Stern and others, 1960) 
Expressive (Ryans, 19602) 
Personally and academically impressive (Hodgson and Horst, 1959; Nunnally, 
Thistlethwaite, and Wolfe, 1963) 
Verbal emphasis (Medley and Mitzel. 1958b) 
Verbal output. (Bowers, Davis, and Bowers, 1962) 
j. Fearful (Heil and Washburne, 1962) 
k. Flexible (Flanders, 1960, b; 1961) 
l. Indirect—nondirective (Flanders, 19602, h; 1961) 
Integrative (Wilk and Edson, 1962) 
Nondirective (Stern and others, 1960) 
m. Inclusive behavior (Cogan, 19582, b) 
n. Informative (Medley, 1962) 
Initiating structure (L. M. Smith, 1960) . 62) 
Problem structuring statements (Bowers, Davis, and Bowers, 1962 
o. Turbulent, impulsive (Heil and Washburne, 1962) 


a 


- Characteristic affective sets 
à. Academic-centered (Ryan 
Traditional orientation (Kerlin 
b. Achievement-motivated (Schmid and others, 1 
Status-striving (Stern and others, 1960) 
c. Affiliation-motivated (Schmid and others, 1962) 
d. Authoritarianism (Cook, Linden, and okay, 1960) 


e. Confident (Wallen and others, 1963) 
ident (Wallen an vedas, anid Mss 1961) 


f. Compulsive conformity (Cook, L n 
g. Dependency (Cook, Linden, ang o 1961) 


Dependent (Stern and others, 
Social dependency (Soar, 1962) . ; 
h. Docility (Cook, Linden, and McKay, 1961 


i. Emotional adjustment (Ryans, 1o c) 


Personal maladjustment (Soar, 1962 
. Preadult fixated (Stern and others, 1960) |. 1961) 
j. Introversion-extroversion (Cook, Linden, ai vm : es 
k. Knowledge of pupil orientation (Schmid and others, 
l. Nurturant (Stern and others, ond 
m. Permissiveness (Christensen, 1960 | 
Dermissive—child-centere i sans, 196a 
rogressive orientation (Kerling r 
as Professional attitude (Sehmid and others, 962 
Professionalism (Soar, 1962 1059) 
Professional satisfaction (Hodgson and Horst, 195 
Gump, and Ryan, 1961) 
(Schmid and others, 1962) 


d Cook, 1960) 


ns, 1960a; Nunnally, Thistlethwaite, and Wolfe, 1963) 


ger and Kaya, 1959) 
962) 
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S- Social superficiality ( 
t. Time spent alone (Wa 


- Retrievable i i ) 
e information , Caiser, 1962) 
a. Knowledge of content and techniques (Cotsonas and Kigeg 96; 
Subject matter knowledge and organiza uen : ith End Meux. 1962) 
Subiect matter knowledge (Ryans sates Se n l 
». Knowledge of professional information (Ryans. ^ 435 
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External Inputs 


1. 
2 


Directions to teacher (Wittrock, 1962) 


i d i socioeconomic 
Grade level or subject taught, size of school, size of community, UE MES "i 
status of community, cultural level of community. and methodological emp 
school (Ryans, 19602) 


. Instructional environment (Nunnally, Thistlethwaite, and Wolfe, 1963) 


i Af ibn interes essure 
. Student conditions: intellectual drive, competition, scientific interest, press 


against scholarly activities (Nunnally. Thistlethwaite, and Wolfe, 1963) 


Teacher Information Processing 


s 4 ee A . seht naking. 
. Information processing by teacher: filtering, redintegration, decision m E 


" i Y T eclive, ane 
encoding, channeling, control of information—i.e., cognitive, affective 
psychomotor information (Ryans, 1963a. b) 


. Teacher feedback effects 


a. Effect of interaction analysis training (Flanders, 1961) 


~ > ake], and 
b. Effects of pupils’ description of own and ideal teacher (Gage, Runkel, 2 
Chatterjee, 1960) 


c. Effects of student reactions (Bryan, 1963) 
d. Theory of feedback (Ryans, 19634) 


. Teacher logical operations 


a. Classes of logical operations (Smith and Meux, 1962) 


" rücessing ^ 
b. Conditions influencing, classes of, and information flow of logical processing 
theoretical (Ryans, 1963a) 


- Teacher problem solving (Turner and Fattu. 1960) ) 


Outputs of Teacher Information 
Processing—Teacher Behaviors 


. Strate; 


" TO 4 feux. 
106] gies followed by teacher to accomplish pupil learning (Smith and Me 


2. Teacher counseling behavior (Ryans, 19632) 
3. Teacher evaluating behavior 


. Teacher organizing-coordinating behavior (Ryans, 196 
. Teacher presenting behav 


a. Fair grading (Hodgson and Horst, 1959) 
b. Teacher evaluating behavior (Ryans, 1963a) 


4. Teacher motivating behavior (Ryans, 19632) 


3a) 
ior (Ryans, 1963a) 


Teacher problem solving in teaching situations (Turner and Fattu, 1961) 


Products of Teacher Behayiors—Pupil Behaviors 


, : 958? 
. Effects on pupil effort and learning of feedback from teacher to pupil (Page. : 
. Pupil achievement 


a. Pupil achievement. (Turner and others, 1963) 
b. Pupil learning (Flanders, 1960b) 
c. Pupil gains (Christensen, 1960) 
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d. Pupil reading growth (Medley and Mitzel, 1959) 
e. Pupil test gains (Heil and Washburne, 1962) 
Pupil activity (Medley, 1962; Wilk and Edson, 1962) 
. Pupil alertness (Ryans, 1960a; 1961a, b) 

Pupil aspirations (Rosenfeld and Zander, 1961) 


on 4 w 


6. Pupil conduct 
a. Pupil conduct in class (Kounin, Gump, and Ryan, 1961) 


b. Pupil disruptive acts (Wilk and Edson, 1962) 
c. Pupil opposition (Heil and Washburne, 1952) 
. Pupil conformance (Heil and Washburne, 1962) 


. Pupil confidence (Ryans, 1960a; 1961a, b) 
{. Smith, 1960) 


so 2) =I 


. Pupil expectations and motives (L. N 
10. Pupil feelings (Heil and Washburne, 1962) 
11. Pupil initiating behavior 
a. Initiating behavior (Medley, 1962; Ryans, 19602, 1961a. b) 
b. Self-initiated activities (Cogan, 19582. b) 
tter (Bryan, 1963) 


12. Pupil interest in subject ma 
and Bowers, 1962) 


13. Pupil leadership activities (Bowers, Davis, 
14, Pupil rapport (Medley and Mitzel, 1959) 
ng skill (Medley and 
(Cogan, 19582, b) 


15. Pupil problem-solvi Mitzel, 1959) 


28: Pupil required activities 
17. Pupil responsibility (Ryans, 19602, 1961a, b) 

18. Pupil social acceptance of one another (Heil and W: 
19. Pupil striving (Heil and Washburne, 1962) 


20. Pupil waivering (Heil and Washburne, 1962) 
ass (Medley and Mitzel, 1958b) 


(Bowers. Davis, and Bowers, 1962) 
rs, Davis, and Bowers, 1962) 


ashburne. 1962) 


21. Soci A 
eh Social organization of cl 
5s "Traditional activities of pupils 

23. Verbal activities of pupils (Bowe 
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CHAPTER VIII 


Teacher Personnel: A Brief Evaluation of 
the Research Reviewed 


WILLIAM B. MICHAEL 


Ix HIS recent evaluation of research reviewed in each of the chapters of 
the June 1963 Review, Alexander posed five different questions, the answers 
to which furnished a basis for the formulation of judgments concerning 
both the worth of the research reported and the direction of future rescarch. 
Slightly modified, these same five questions may be used as standards 
against which to evaluate the research pertaining to teacher personnel: 
Which studies reviewed seem to be critically significant? What new and 
promising areas of research concerned with problems of teacher personnel 
have been opened? What significant contributions to a theoretical con- 


ies : ? 
ceptualization of problem areas in teacher personnel have been proposed? 


What new and promising methodological approaches to research concern- 
ing teacher personnel h 


ave been introduced? What lies ahead in research 
for teacher personnel? 


To a great extent, the answers to these questions depend on the manner 
in which the reader defi 


they frequently serve to answer practical ques- 
cy and to yield a basis for prediction of trends. 
urvey type are useful in determination 
ment, promotion, and retention of per- 
hool finance and tax rales; and to count- 
$ also a place in educational research for 
a scholarly attempt to analyze, to synthe- 
mportant research findings. Furthermore, 
hical and historical issues, with problems 


h methodological and statistical develop- 
ments can represent significant contributions to educational research. 


To a considerable extent, however, it appears that those research papers 
that are related to a theoretical framework constitute the ones most likely 
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to contain the findings necessary for the development of a science of educa- 
tion. If one examines recent issues of the Review and compares them with 
those of five or ten years ago, it appears that there is an increase in the 
proportion of articles cited in the bibliographies that reflect concern with 
experimental verification of hypotheses or of predictions derived from a 
theoretical structure. Such articles should be given high priority for 
inclusion in the Review. In this issue, the reader will note that except for 
Chapter VII relatively little of the research reported can be regarded as 
theoretical in its orientation—a fact which'is due probably in part to the 
nature of the topics considered. 


High Standards Needed in Selecting 
Publications for the REVIEW 


Irrespective of what types of research studies are to be described in future 
issues of the Review, it is the writer’s contention that authors should give 
greater attention than they have in the past to establishing a set of rela- 
tively rigorous standards by which they can decide whether to include a 
given article, report, or book. The rate of publication is increasing so 
rapidly that only a small fraction of the total number of articles in 
the professional literature can be cited. Probably, on the average, only 
10 percent of published papers in educational journals are worthy of being 
reported in the REVIEW. 

The writer’s bias was shared by a group of contributors to Chapter V 
of the June 1963 Review who established and applied specific standards to 
the selection of a relatively small number of references from a relatively 
large tentative bibliography. Such criteria as the relevance of the problem, 
adequacy of the experimental design, and appropriateness of statistical 
treatment had to be met. In a pointed statement that constituted their 
closing paragraph, Gibboney and others wrote: 

“This journal could perform a vital service by rigorously selecting fewer 
by analyzing each study in depth, and by candidly 
hs and weaknesses. Exclusion from the Review 
come the vehicle for rigorous internal professional 


studies for review, 
reporting its strengt 
would increasingly be 
criticism." 


Published Works of Critical Significance 


In his conscientious efforts to be highly selective, each contributor to this 
issue of the Review not only tried to include significant studies but also 
endeavored to cover each of the major problem areas within his assigned 
topic. Several of the contributors who indicated marked conflicts concern- 
ing the inclusion of certain studies of questionable quality decided that 
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they had an obligation to report a certain amount of research in cie 
topical area, even at the risk of including the most promising Saa 
among several that seemed to constitute mediocre and trivial inicia 
It may well be that certain topics should have been omitted. In the writer 5 
estimation, slightly more than 55 percent of the entries in the bibliographies 
could have been deleted without any noticeable depreciation in the value 
of the current issue. This opinion is based on a careful examination of 
the contents of approximately 87 percent of the references cited. . 
Without doubt, the most significant contribution during the past five 
years was the AERA-sponsored Handbook: of Research on Teaching, edited 
by Gage (1963a). The strong emphases on both a theoretical framework 
and research methodology, the attempts at a description of the major 
variables and areas of research on teaching, and the detailed exposition of 
research on teaching at various grade levels and in different subject matter 
areas should serve to encourage its readers to raise the level of their own 


future research endeavors that are concerned with teaching behavior, the 
instructional process, and the related 


recruitment. Careful reading of chapte 
gest numerous research problems wo 
point to improvements both 


knowledge concerning teaching 
€ a direct impact on the problems of teacher 
Te-service and in-service education of teachers. 
ent and service in elementary and secondary 
education, and the realization of improvements 
social status of teachers. " 

ribution was Ryans’ (1960) Characteristics of 
Teachers, which brought together the extensive results of a large-scale 


r ing the assessment of teacher behavior. The 
contents of this monumental volume have been summarized briefly by 
this issue, 


concerning teacher supply 
two studies as well as the 


hers recruited the gap is 
i : States. with relatively high standards and those 
states with relatively low standards, Closely related to the fact that those 
states with Higher Fügnis the services of the better 
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reviewed in Chapter II—that the preparation of teachers is being im- 
proved and extended in general education, in the field of academic and 
technical specialization, and in professional education. 

The conclusions that several of the contributors discerningly reached 
from their reviews of the literature were indeed disappointing. In Chapter 
II, Durflinger decried the lack of replication of investigations and the lack 
of any unified or substantial cooperative efforts in the large number of 
reported empirical studies that were concerned with recruitment and selec- 
tion of teachers. Even-less encouraging than Durflinger’s conclusion was 
Gordon’s pointed criticism in Chapter IV of the barren and unimagina- 
lively repetitive nature of research regarding conditions of employment 
and service in elementary and secondary schools—an emptiness that was 
attributed to lack of a clear definition of the role of education and of the 
teacher in American society. In a similar vein, Marks and Dillon noted in 
Chapter V a marked absence of systematic research about problems related 
to teacher personnel in higher education. Although sterility in concept also 
was apparent in those phases of Chapter VI concerned with economic and 
legal status of teachers, some measure of hope could be taken from those 
cited studies regarding the social status of teachers in which role percep- 
tions and image theory afforded a number of promising hypotheses worthy 


of additional investigation. 


New and Promising Areas of Research 


From the reviews of the published literature during the past five years, it 
is with respect to the teaching process itself that the greatest potentiality 
for research lies, as inspection of the Handbook of Research on Teaching 
will readily reveal. In particular, the paradigms for research on teaching 
reported by Gage (1963b) hold great promise for guiding future research. 
Similarly, the numerous models which Ryans described in Chapter Vil 
under the heading of Theory of Instruction and Teacher Behavior consti- 
tute important steps forward both in the development of conceptual frame- 
works within which the teaching-learning process may be viewed and in the 


heuristic values which these models possess. 


Contributions to Theory 


evious paragraph, the construction of comprehen- 
Sive paradigms and models related to the instructional and learning process 
has been an important development—perhaps even a major breakthrough. 
Tt appears that the educational psychologists have moved quickly in direc- 
tions that their colleagues in experimental and theoretical psychology in 
Division 3 of the American Psychological Association (APA) have been 
following for several years. À most encouraging sign has been that the 
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educational psychologists who contributed important chapters to i: 
Handbook of Research on Teaching not only have manifested. a mar A 
degree of sophistication about model building but also have given a 
evidence of their concern with the applications afforded by theoretica 
conceptualizations to problems of teaching in the classroom. It is encour- 
aging to see that all of the highly talented psychologists who are accom- 


plished in theory building do not reside in the exclusive community of 
APA's Division 3. 


New and Promising Procedures of Research 


As mentioned previously, it is in the study of teacher behavior that 
innovations in research methodology have been most prominent. In Chap- 
ter VII, specific attention was given to the criterion problem. Six chapters 
in the Handbook of Research on Teaching were written concer 
ologies of research in teaching. 
that is quantitatively orient 
which research workers will 


ning method- 
For the whole area of educational research 


ed, these chapters will represent keystones 
wish to consult. 


Future of Research in Teacher Personnel 


As of the moment, the recen 
ment of teacher behavior an 


oblem areas of teacher personnel, it may be 


i $ of survey-research studies will continue to be 
made, although it can be hoped that theoretical considerations will eventu- 


ally assume some degree of importance, A fundamental weakness of many 
investigations concerned with legal, economic, and related demographic 
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problems has been neglect of what the aims and purposes of American 
education really are. Value-oriented hypotheses formulated with the aid 
of educational philosophers and historians, psychologists, sociologists, and 
anthropologists could do much to furnish a social framework within 
which status studies could be made increasingly meaningful. 

In view of the uneven rate of growth of the research contributions from 
each of the conventionally viewed areas of teacher personnel, it may be 
advisable in the futuré that two types of issues of the Review be developed. 
First, problems of the assessment of teacher behavior, teacher selection, 
teacher training, and póssibly social status might be in one cycle of publica- 
tion. On the other hand, separate issues of the Review could be devoted 
at irregular intervals to administrative problems concerned with recruit- 
ment; with certification, supply, and demand; and with economic, legal, 
and related demographic characteristics. At the moment, it appears that 
the significance of educational research is somewhat greater in the realm 
of teacher behavior and the instructional process than it is in other topics 
of teacher personnel. Perhaps issues every three years and every six years, 
respectively, would be advisable for reporting (a) research on the problems 
directly related to teaching behavior itself and to the instructional process 
and (b) studies on the relatively mechanistic administrative problems 
underlying the professional activities of teachers. 

Such E, however, may introduce certain risks. Indeed, ihe 
opportunity for transfer effects of research efforts concerning the dews 
of teaching to related administrative procedures would be curtaile 3 
vague, ill-defined unity of topical areas in teacher personnel, sS pe 
be better than no unity at all, could be destroyed. Nevertheless, if the leve 
of sophistication in research concerned with teacher behavior continues 3 
grow as rapidly as it has during the past five years and if - mea 
concerned with other aspects of teacher personnel continues a ollow v 
Same unimaginative, repetitive pattern, the differences in the pes : 
audiences interested in these two broad areas of teacher personnel can be 
expected to become increasingly divergent. Within the next two " id 
Years, the answer may be forthcoming concerning e ee psycho osy 
and sociology of the teaching process has indeed os gop 
eminence apart from other problem areas underlying the globa 


teacher personnel. 


Bibliography 


Gace, N. L., editor. Handbook of Research. on Teaching. Chicago: Rand McNally 
& Co., 1963. 1218 pp. (a) um i a , 

G N E: H ch on Teaching.” Handboo of Research on 
Teci SUY Cee Chicago: Rand McNally & Co., 1963. Chapter 
3, pp. 94-141. (b) . EENE Demand 

N EDUCA x, ResearcH Division. Teacher Supply and Deman« 
in Publi. ON Bed. Research Report 1960-R7. Washington, D.C.: the Associ- 
ation, 1960. 48 pp. 

447 


Review OF EDUCATIONAL RESEARCH Vol. XXXIII, No. 4 


NATIONAL EDUCATION Association, Reseancu Division, Teacher Supply and Demand 
in Public Schools, 1962. Research Report 1962-R8. Washington, D.C.: the Associ- 
ation, 1962. 55 pp. 

Ryans, Dav G. Characteristics of Teachers. Washington, D.C.: American Council 
on Education, 1960. 416 pp. 


Sonts Tayra Ce! kc od i asd to Public Education. New York: Thomas 


448 


— 


INDEX TO VOLUME XXXIII, NO. 4 


Pase Didi ae " , TE 
i age citations are made to single pages; these are often the beginning of a 
chapter, section, or running discussion dealing with a topic. 


Assessment of teacher behavior and in- 
struction: criterion přoblem, 422; fu- 
ture research, 433; general references 
concerning, 416; information system 
model, research literature viewed 
within framework of, 420; interpersonal 
relations in classroom, 424; limitations 
in findings, 431; methodology, 422; ob- 
servation, 423; overview of trends in 
research, 415; scheme for classifying 
teacher behaviors, 423; teacher effec- 
tiveness, 431; theory of instruction and 
teacher behavior, 419 

Autonomy of teachers: 402 


Campus laboratory school: 377 

Certification: levels of responsibilities, 
344; state-by-state variations, 343; 
teacher examinations, 345; trends, 346; 
variations in, 344 

College instruction: research in, 431 

Conditions of employment and service in 
elementary and secondary schools: eco- 
nomic status and economic conditions 
of employment, 386; employment, as- 
signment, and turnover, 385; evalua- 
tion, retention, and dismissal, 385: 
needed research in, 388; staff organi- 
zation and utilization, 383; teacher 
load, 384; teacher morale and job 
satisfaction, 387; teacher status and 
role expectations, 382 

Contributions to theory: 445 

Criterion problem: in assessment of 
teaching, 422 


Description of teacher behavior based 
on: direct observation, 428: inferences, 
428 

Dismissal of teachers: 385 


Economic status and economic conditions 
of employment for teachers: 386 
lementary school teacher education: 370 
mployment. assignment. and turnover: 
385 T 


Evaluation: in elementary and second- 
ary schools, 385; in higher education, 
392; of research reviewed in issue, 442 


Factor studies: contributions to identifi- 
cation of characteristics in training and 
selection of teachers, 364 

Future of research in teacher personnel: 


446 


High standards needed in selecting pub- 
lications for the Review: 443 


Information system model of teaching: 
external inputs—conditions external to 
the teacher, 426; internal inputs con- 
tributing to instructor behavior, 424; 
products of instructor behaviors—pupil 
behaviors, 429; teacher behaviors—out- 
puts of teacher information processing. 
427; teacher information processing 
behavior, 426 

In-service education: of teachers, 373 

Instructor, or teacher, behavior: teacher- 


student variables in, 424 


Legal status of teachers: certification, 
tenure, and autonomy, pertaining to, 
402; civil rights, regarding, 403; com- 
pensation, residual rights, and royalties 
in relation to television, radio, and 
programed instruction, 404; retirement, 
with reference to, 405; teacher-board 
relations, concerning, 404; teacher- 
pupil relations, concerning, 404 

Load for teachers: 384 


Methodology in assessment of teacher 


behavior and instruction: criterion 
problem, 422; interpersonal relations 
in the classroom, 424; observation proc- 
esses, 423; scheme for classifying 
teacher behaviors, 423 
Morale, opinions. and attitudes: of 
teachers, 409 


449 


REVIEW OF EDUCATIONAL RESEARCH 


Need and preparation of teachers: in 
higher education, 391 . 
New and promising areas of research: in 

teacher personnel, 445 
New and promising procedures of re- 
search: in teacher personnel, 446 
New media: in teacher education, 374 


Perception-diagnosis theory of teaching: 
419 

Personality characteristics: in 
selection, 361 

Pre-service programs: 370 

Pre-service and in-service education for 
teachers: campus laboratory school, 
role of, 377; elementary teachers, con- 
cerning, 370; in-service Programs, 373; 
new media, with reference to, 374; 
needed research in, 378; Professional 
laboratory experiences, concerning, 
375; secondary teachers, 
371 


teacher 


concerning, 


Published works: of critical significance 
in area of teacher Personnel, 443 
Pupil behaviors: products of instructor 


behaviors, 429; pupil achievement and 
learning, 430 


Recruitment: age at 
be teacher, 356; 
making, 356 
ecruitment and s 
tive elementary a 
teachers: factor Studies, 364; recruit. 


» 355; selection, factors 
Pertaining to, 358 


Research needed concerning: conditions 
of employment and service in elemen- 
tary and secondary schools, 388; pre. 
service and in-service teacher education, 
378; teacher certification, Supply, and 


demand, 351; understanding teacher 
behavior, 433 


Retention of teachers: 395 


making decision to 
factors in decision 


Salaries and other benefits; 
tary and secondary educati 
higher education, 394; for teachers in 
general, 398; for women te 


achers, 399; 
in relation to additional employment, 
400 


in elemen. 
on, 386; in 


450 


Scheme for classifying teacher behaviors: 
423 . i 
Secondary school teacher education: 37] 
Selection, factors pertaining to: academic 
competence, 359; career commitment, 
361; criteria, 358; personality charac- 
teristics, 361 . 
Social status of teachers in relation to: 
educational level, 406; images and 
Stereotypes of teachers and teaching, 
407; occupation of father, 407 . 
Staff organization and utilization: 383 
Summary of teacher-student variables: as 
considered in the literature, 421 : 
Supply and demand: economic factors, 
347; needed research in, 351; remedies 
for the shortage, 350; subject and grade 
level factors, 348; other factors, 310 


Teacher as problem solver: model for, 421 

Teacher attitudes and opinions: 409 

Teacher-board relations: legal aspects of. 
404 

Teacher certification, supply, and dc- 
mand: 343 

Teacher education: 369 

Teacher effectiveness: 431 

Teacher examinations: 345 

Teacher load: 384 

Teacher morale: 397 n 

Teacher personnel research in ihe 
education: need and preparation 0 
teachers, 391; new data sources con- 
cerning, 396; ratings and evaluation 
in, 392; salaries and other benefits, 
pertaining to, 394 t 

Teacher-pupil relations: legal aspects of, 
404 


Teacher status and role expectations: 382 
Teacher-student variables in relation to 
Systems approach: 424 i 9 
Teaching as system of social action: 41 
Tenure of teachers: 402 . 
Theory of instruction and teacher be: 
havior: contributions in general, es. 
information system model, 420; model 
viewing teacher as problem solver, 42] ; 
perception-diagnosis theory of teaching. 


419; teaching as system of social action, 
419 


CE 


REVIEW OF EDUCATIONAL RESEARCH 


Official Publication of the American Educational Research Association 
Contents are listed in the Education Index. 


Copyright 1963 
By National Education Association of the United States, Washington, D.C. 


>_—_— 


Volume XXXIII, No. 5 December 1963 


ii, Si 


Statistical Methodology 


Reviews the literature for the three-year period 
since the issuance of Vol. XXX, No. 5, December 1960. 


TABLE OF CONTENTS 


Chapter Page 
EGREWOrd aeree BEERE E a a E E ieuete minui 454 
I. Hypothesis Testing in Relation to Statistical Methodology. . . . 455 
Cuerry Ann Crank, The Meyers Clinic, Los Angeles, and Claremont 
Graduate School, Claremont, California 
II. Selected Contributions to Parametric and Nonparametric Sta- 
A cearczecateseme bs d MSS e leid e eniti TM 474 
Wittram B. Micmac, University of California, Santa Barbara, 
California 
III. Developments in Analysis of Variance and Design of Experi- 
MERES as or ARA v ona MAI dte disc AIRS TEA 490 
Dee W. Norton, University of lowa, Iowa City, Iowa 
. 501 


IV. Regression and Correlation. i.e eee e etter eees 
Wituiam J. Moonan, U.S. Naval Personnel Research Activity, San 


Diego, California A " 
dong H 2 hee U.S. Naval Personnel Research Activity, San 


Diego. California 


451 


Chapter 
V. Factor Analytic Methodology 
Pm R. Merrirterp, System Development Corporation, Santa 
Monica, California 


Norman Curr, University of Southern California, Los Angeles, 
California 


VI. Design of Sample Surveys in Education 
Francis G. Cornett, Educational Research Services, Inc., White 
Plains, New York 


Eucene P. McLoone, Office of Education, U.S. Department of Health, 
Education, and Welfare, Washington, D.C. 


VII. Scaling and Measurement Theory 


Hanorp SCHIFFMAN 
Jersey 
SAMUEL Messick 


» Educational Testing Service, Princeton, New 
» Educational Testing Service, Princeton, New Jersey 


VIIT. Statistical Learning Models 


Rowarp A. Weitzman, Los 
California 


Angeles State College, Los Angeles, 


IX. Decision Theory and Utility Management 


RonEnT W. Eart, Claremont Graduate School, 


Claremont, California 


X. Use of Computers in Educational Research 


Frank B. Baker, University of Wisconsin, M 


adison, Wisconsin 


LM C a 


This issue of the Review was prepared by the 
Committee on Statistical Methodology 


William B. Michael, Chairman, University of California, Santa Barbara, 
California 

Raymond O. Collier, University of Minnesota, Minneapolis, Minnesota 

Henry F. Kaiser, University of Illinois, Urbana, Illinois 

Julian Stanley, University of Wisconsin, Madison, Wisconsin 


with the assistance of 


Frank B. Baker, University of Wisconsin, Madison, Wisconsin 

Cherry Ann Clark, The Meyers Clinic, Los Angeles, and Claremont Gradu- 
ate School, Claremont, California 

Norman Cliff, University of Southern California, 

Francis G. Cornell, Educational Research Servic 
New York 

Robert W. Earl, Claremont Graduate School, Claremont, California 

Eugene P. McLoone, Office of Education, U.S. Department of Health, 
Education, and Welfare, Washington, D.C. 

Philip R. Merrifield, System Development Corporation, 


California 
Samuel Messick, Educational Testing Service, Princeton, New Jersey 
William J. Moonan, U.S. Naval Personnel Research Activity, San Diego. 
California 
Dee W. Norton, University 
Harold Schiffman, Educational Testi 
Ronald A. Weitzman, Los Angeles State Coll 
John H. Wolfe, U.S. Naval Personnel Research 


fornia 


Los Angeles, California 
es, Inc., White Plains, 


Santa Monica, 


of Iowa, Iowa City, Iowa 

ng Service, Princeton, New Jersey 
ege, Los Angeles, California 
Activity, San Diego, Cali- 


453 


FOREWORD 


It has been nine years since there has been an issue of the t 
devoted exclusively to statistical methodology. The rate at which p» i P 
tion is increasing is nothing short of astounding. Likewise, the eve, 7 
mathematical maturity required of the reader of statistical mes ae ict 
rising at a correspondingly rapid rate. One dare not think what the status 
of statistical methodology will be by the year 1972! " 

Despite the almost formidable advances in staiistical knowledge, ue 
author or authors of each chapter have made a sincere attempt to mores 
the conceptual significance of the methodological developments with v 
they have been dealing. To translate concepts from a largely mathematica 
formulation to a verbal medium inevitably introduces errors in communi- 
cation—especially in the direction of oversimplification. Since most readers 
of the REVIEW have not specialized in statistics, the attempt to convey in 
verbal terms the major developments in statistical methodology seems to 
be a desirable and defensible objective. How well the writers of the current 
issue have succeeded must await the reaction of the readers. , 

It should be noted that since chapters on statistical methodology, experi- 
mental design, scaling theory, and data processing appeared in the Decem- 
ber 1957 and the December 1960 issues of the Review, not all chapters 
in the present issue furnish coverage for a nine-year period. Chapters D 
VIII, and IX afford a review of contributions during the nine-year pee 
from 1954 to 1963, although relatively greater emphasis has been placet 
on the developments of the past three years in contrast to that put on 
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CHAPTER I 


Hypothesis Testing in Relation to 
Statistical Methodology 


CHERRY ANN CLARK 


? 


Tire snorrcomsos in the methodology of statistical hypothesis testing 
used in educational and psychological research have been emphasized 
repeatedly in recent behavioral science and statistical literature. (Binder, 
1963; Edwards, Lindman, and Savage, 1963; Grant, 1962; Lubin, 1962; 
MeNemar, 1960; Mowrer, 1960; Nunnally, 1960; Rozeboom, 1960; Savage, 
1957). This chapter reviews the salient points of the criticisms. Since 
this issue of the Review marks the first time an entire chapter has been 
devoted to the statistical methodology of hypothesis testing, a brief account 
of several theories of statistical inference is included to provide a back- 
ground for evaluating the rationales of significance tests compared with 
other methods for statistical inferences, as well as the modifications in 
the function of the null hypothesis in testing statistical hypotheses. The 
problems in statistical inference which have been associated with the 
widespread use of significance tests are reviewed. The limitations in and 
the effectiveness of significance tests as methods for informative inference 
are described. The applications of interval estimation contrasted with 
Significance tests in the investigation of hypotheses and models and in 
the development of a body of empirical data are summarized. 


Hypothesis Testing and Statistical Inference 


and complex problem in the methodology 
ictions or the predictions from 
ata to eliminate unsatisfactory 


Hypothesis testing is a central 
of science, It involves comparing the dedu 
Scientific hypotheses with observational d 
hypotheses and to give support,to satisfactory ones. . 
. Å primary objective of hypothesis testing methodology is the formula- 
lion of rules or criteria, sometimes called decision procedures, to use in 
determining whether the data should be construed as rejecting or accepting 
the hypothesis under. investigation. Decision procedures are judged by 
Such criteria as consistency. relevance, completeness. and effectiveness. 

ecision rulés are formulated to provide objective. reliable, and valid 
solutions to important problems in the analysis of data (Buehler, 1959; 


Hotelling, 1958; Jeffreys, 1957. 1961; Tukey, 1962). The hypotheses to 


he tested must be examined for their logical validity or consistency. their 


Yeuristic value, and their amenability to empirical test. The testing of 
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hypotheses requires that appropriate controls be employed e. pe 
observations and in analyzing the data. Systematic errors in the ni 
mental procedures must be minimized if decision rules are to be effici 
(Tukey, 1954, 1960a, 1962). u 
Statistical hypothesis testing is a special instance of the general a 
tific method of testing hypotheses. As in the general method, statistica 
hypothesis testing combines deductive and inductive methods in im 
ways. l'or the most part, statistical hypothesis testing has been concerne 
with determining which one of a dichotomous set of mutually exclusive 
and exhaustive hypotheses is to be rejected and which one accepted ata 
specified level of risk in making an erroneous conclusion or decision. 
Probability theory provides the deductive foundations for theories of 
statistical inference (Hotelling, 1958). Natural phenomena which are pre- 
sumed to be subject to random fluctuations, either because of a stochastic 
process characterizing their behavior or because of random errors in 
observing or measuring them, provide the inductive basis for statistical 
inference. Both enumerative and eliminative methods of induction are 
used in treating the data and in formulating statistical generalizations. 
The question of whether statistical methods based on eliminative or 
enumerative rules of induction constitute the best foundation for statistical 
inference is widely disputed (Rozeboom, 1960; Savage and others, 1962). 
The argument about whether scientific knowledge advances primarily by 
accumulation of positive instances of phenomena to support a hypothesis 
or by negative instances to refute a hypothesis is the focus of some of the 
criticism of the classical theories of statistical inference—namely, the 
Fisher and Neyman-Pearson theories (Savage, 1961). Another argument 
in the current controversy on the foundations of statistics focuses on the 
question of whether there is a sound logical basis for assigning a direct 
probability to a set of propositions or hypotheses. The frequency 9! 
objective school has denied the legitimacy of such a procedure. Probability, 
for this school, refers to the relative frequency of events within a class 
of random recurrences (Hotelling, 1958), not to the amount of support 
for or the degree of belief in a hypothesis. On the other hand, the sub- 
jective probability school, among others, has argued that a fruitful an 
consistent way to proceed with statistical inferences is by assigning ? 
degree of belief to a statement based upon available information and then 
by altering the amount of uncertainty about the statement in the light a 
additional experimental evidence by the use of Bayes’s theorem (Lindley; 
1953, 1961; Rozeboom, 1960; Savage, 1961). d 
Another aspect of the controversy is how the likelihood function shoul 
be incorporated into statistical theory and practice. One of the serious 
limitations in the classical theories of significance tests is chat they are 
not dependent upon the likelihood function (Birnbaum, 1962). F 
Statistical hypotheses are more restricted than are scientific hypotheses: 
just as statistical inferences are more restricted than scientific inferences 
(Bolles, 1962; Cox, 1958b; Tukey, 1960a). Statistical hypotheses con- 
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cern the behavior of observable random variables, whereas scientific 
hypotheses treat the phenomena of nature and man. A null hypothe- 
sis is a particular statistical hypothesis which refers to the theoretical 
probability distribution governing the random variable(s) under investi- 
gation and is the hypothesis under test (Kendall and Stuart, 1961). Sta- 
tistical inference, broadly defined, deals with statements about statistical 
populations made from given observations with a measured degree of 
uncertainty (Cox, 1958b). Statistical inference proceeds from observa- 
t the populations sampled. Scientific inference, 
populations 
estigation. 
a general 


tions to conclusions abou 
on the other hand, often argues from descriptive facts about 
to an abstraction about the system of phenomena under inv 


A statistical inference frequently is instrumental in formulating 


inference, but usually it constitutes only a small part of the uncertainty 
Cox, 1958b; Tukey, 1960a). 


connected with the scientific inference ( 
Decision procedures used in making statistical inferences, including 
those that aid in the selection of statistical design and analysis (Cox. 
1958a; Savage, 1961), require that the investigator use his judgment in 
diverse ways. How such judgment should be allowed to affect the use of the 
decision procedures is a crucial issue in the present controversy on the 
foundations of statistical inference (Savage and others, 1962; Tukey, 
1954, 1960a, 1962). Much of the criticism of the rationales for significance 
tests has been aimed at the inconsistencies in the performance of the 
decision procedures (Berkson, 1942; Birnbaum, 1962; Cox, 1958b; 
Edwards, Lindman, and Savage, 1963; Pratt, 1961a, b; Savage and others, 
1962). 
of Theories of Statistical Inference 
and Significance Tests 
preceded the development of any systematic 
theories of statistical inference. Barnard (Savage and others, 1962) and 
Lehmann (1959) have mentioned their use by two early probability 
theorists, Daniel Bernoulli and Pierre Simon de Laplace. The principal 
Components of significance tesls were contained in their mathematical 
Studies on the departures of planetary orbits from specific mathematical 
models, The essential components were the theoretical or mathematical 
hypotheses and equations and the observations on the extent of the depar- 
tures of the planetary planes from the expected values. The significant 
epartures were those that were considered unlikely on a particular 
hypothesis, When Bernoulli found that the values he computed were 
uniformly improbable on the hypotheses, he concluded that the observa- 
Hons provided evidence for rejecting the initial hypotheses. He did not 
consider an alternative hypothesis, nor did he have a particular degree of 
departure in mind before he made his computations. . f 
Karl Pearson’s work on goodness of fit tests and the chi square distri- 
ution and W. S. Gossett's derivation of the student distribution, a small 
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x P eee 3 : - subse: 
sample exact-sampling distribution, were important influences on sub 
quent developments in the theory of significance tests, 


Contributions of Fisher to Statistical Inference 


Sir Ronald Fisher has been called the founder of modern item: 
theory. He propounded many basic concepts and raethods and developed 
the first complete theory of ststistical inference (Hotelling, 1951: Pearson. 
1962; Yates, 1951). Among his contributions to tne theory of significance 
tests were a systematic rationale for critical ratio tests based on the 
Central Limit Theorem and the Law of Large Numbers: exact small sample 
tests for which he derived many theoretical sampling distributions: and 
randomization or permulation tests to be used in complex experimental 
designs when the assumptions underlying other significance tests proge- 
dures are not fulfilled. He developed interval estimation methods, which 
culminated in his theory of fiducial inference with its fiducial intervals. 


He introduced an extensive formal theory of experimental and statistical 


design and related methods of analysis, including the analyses of variance 
and covariance, He set forth th 


he method of maximum likelihood as a 
foundation for statistical inference and as a basis for developing many 
statistical techniques. He discussed the concept of suficient statistics. 
which has been used extensively in modern statistical theories. He argued 


that inverse probability has no place in modern science or statistical theory 
(Fisher, 1956, 1960). 


e originated the term null hypothesis. He presented significance tests 
of null hypotheses as examples of a logical disjunction. In this view, the 
be proved by experimentation. On the con- 
sts only to be rejected by a sufficiently wes 
experimenter interested in accepting the nul 

ng in determining the uniformity of experi- 
mental procedures (Fisher, 1960), Fisher discussed the combination of 
ith. significance tests after the initial phases © 
en completed, 
Hotelling (1951) and Yates (1951), in evaluating Fisher's impact upon 


t ; s ve 
n the social sciences, suggested that his books han 

ot : - examp 
not had an altogether positive influence. Psychologists, for examp'e: 
have been content to 


: st 
en 4 publish Very limited reports on one or at im 
several significance tests based on null hypotheses of no difference (Grant, 

J. ery 
1962; Nunnally, 1960; Rozeboom, 1960). Such statistical tests have very 


little informative value, They often are known to be false prior to expert 
mentation (Savage, 1954), 


Contributions of Neyman and Pearson to Statistical Inference 


Jerzy Neyman and Egon 
on the development of m 
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Some of their work was an extension of Fisher’s contributions, and some 
of it was a reaction to Fisher’s proposals. They expanded and systematized 
the rationale of significance tests. They argued for the necessity of care- 
fully considering the alternative to the null hypothesis in making any 
lest of significance. They introduced the notions of errors of the first and 
second kinds, the power of a test in discriminating between the hypothesis 
under test and the alternative, and the comparison of the power functions 
of different tests to aid in the selection of the most satisfactory statistical 
test. They formulated an, extensive rationale tor the comparison and deriva- 
tion of different tests in terms of their “nice properties,” such as the 
uniformly most powerful test, most powerful tests, and unbiased tests. 
They expanded Fisher’s maximum likelihood method into the test cri- 
terion called the Neyman-Pearson lemma, which they used to develop tests 
of simple statistical hypotheses. They gave considerable attention to the 
likelihood ratio in the development of likelihood ratio tests, including 
both univariate and multivariate tests. They pointed out the great com- 
plexity of the problems in deriving suitable test criteria for complex 
hypotheses as contrasted with simple hypotheses. They, like Fisher, demon- 
strated the wide variety of statistical tests to be derived from the maximum 
likelihood method. n 

They posited that the size of the sample should be set before experi- 
mentation is begun. The decision rules for tests of significance within their 
theory are based upon hypothetical repetitions of similar size random 
samples from the population(s) under investigation. Determination of 
the size of the sample is a function both of the size of the critical region 
and of the sensitivity or power of the test in detecting departures from 
the null hypothesis. They conceptualized the error rate in significance 
tests as the probability of obtaining a given or more extreme level of 
significance in an extended series of random fixed sample size experiments. 
They reasoned that a significance test is a decision procedure to be used 
as an aid in deciding which one of two actions to take in the face of 


uncertainty (Lehmann, 1959). 


— i 
In their exposition of the theory of significance tests, Neyman and 


Pearson incorporated the spirit of probabilistic reasoning for a two- 
Valued problem. Theoretically, the null hypothesis and the alternative 
hypothesis have equal status: the data are to determine which hypothesis 
is more probable. The error of the first kind is considered ke more 
Serious; it is determined by the size of the critical region or the = 
of significance. The error of the second kind cannot be controlled di- 
rectly by the experimenter, but is a function of the size of the critical 
region and of the characteristics of the power function of the test in 
relation to ihe specific alternative. hypothesis under E per ne In 
actuality, the null hypothesis is rarely found to be acceptable PA he Tomum. 
Carson formulation, for as the sample size increases, a » pe yeni 
tive test rejects the null hypothesis at a decreasing level of significance 

endail and Stuart, 1961; Pratt, 1961b). This fact has been of grave 
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concern to a number of statisticians (Savage and others, 1962). uw hes 
in addition to certain other inconsistencies in the performance oft he bee 
sions rule, has evoked sharp criticism of this classical method for statis 
inference (Pratt, 1961b). . 

nthe ecu ent theory indicated the relationship see m 
of significance and the method of interval estimation called con ur " 
intervals (Berkson, 1942; Birnbaum, 1961; Bulmer, 1957; Kenda À : 
Stuart, 1961; Lehmann, 1959; Natrella, 1960; Pratt, 196la). A Won oa 
significance is concerned with only those values specified by the hypot m 
under consideration, while confidence intervals by their width and by 
the level of confidence indicate the multitude of statistical hypotheses 
which are acceptable as well as how acceptable each hypothesis is by n 
location within the confidence interval. The Neyman-Pearson rationale 
showed the practical and theoretical equivalence of tests of significance 
accompanied by the power function or the operating characteristic aire 
and confidence intervals (Natrella, 1960) . Pratt (1961a, b) reminded statis- 
ticians of one important shortcoming of the theory of confidence inter- 
vals: the probability is either one or zero that the value of interest is 
contained within the confidence interval; moreover, the formulation is 
based upon the frequency interpretation of probability, which does not 


allow for a straightforward degree of confidence in the likelihood of the 
obtained statistic. 


Educational and psychological inv. 
the Neyman-Pearson formulatio 
interval estimation. 


estigators have rarely used ging 
H : " a T 
n either for significance testing or fo 


Contributions of Wald to Statistical Inference 


In the short but exce 
statistics, he made ma 
tistical hypothesis testing. 
into statistical theory an 
selected samples of varyi 
design and analysis to 
Pearson method. He d 


ptionally productive time that Wald contributed to 


t 
Wo constants are selected for er jos 
£ht to the probability of each o 


In his Statistical Decision Functions, Wald (1950) gave explicit m 
sideration to the statistician's role as 4 decision maker by his detal 
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specifications of the various factors of the decision situation. Previously 
Neyman and Pearson had given indirect consideration to the possible 
losses consequent to a wrong decision by the different values to be selected 
for the two kinds of errors. Instead of basing the decision rule upon the 
specification of error rates as Fisher and Neyman and Pearson had done, 
Wald gave formal recognition to the assessment of losses involved in 
making wrong decisions (Bahadur and Robbins, 1950; Lindley, 1953, 
1961). c 

The formal matheniatical structure of statistical decision theory is 
based upon the theory of games (Luce and Raiffa, 1957). It requires the 
statistical consumer or investigator to be able to list a set of possible 
d action. The action preferred depends 
is, in other words, upon the unknown 
s among the actions are assessed 
ble action to be taken in the 
ch may be referred 


actions, among which is a preferre 
on what the “true state of nature” 
value of some parameter. The preference 
in terms of the losses attached to each possi 
presence of each state of nature (the circumstance whi 
to as the loss function). The decision maker can obtain information 
about the state of nature at some specified cost for the observations made; 
he must balance cost against possible losses in the face of insufficient 
information. The decision function, then, depends upon the given loss 
function, the set of possible actions, the possible states of nature, the prior 
information available about the state of nature, and the cost of making 
observations (Savage, 1954). 

Point estimation, interval estimation, and hypothesis testing are sub- 
sumed within a single theory, known as statistical hypothesis testing. The 
problem of choosing a design and an analysis is integrated within the 
theory and is closely associated with the factors determining the choice 


of the statistical decision function. 


Another noteworthy contribution o | wa: ition 
there is not a unique decision procedure which is equally effective in all 


conditions, He showed that in those cases where prior information is 
available, a Bayes solution is admissible and optimum. There are situa- 
tions, he pointed out, in which a Bayes solution is not part of an admis- 
sible strategy. He also raised the question of formulating a class of 
admissible hypotheses, and he thereby clarified some of the issues con- 
nected with composite hypotheses in the Neyman-Pearson formulation. 

The theory of testing statistical hypotheses provides a decision pro- 
cedure for working with multiple hypotheses rather than merely with 
two hypotheses, as in nificance tests. For example, given 


the theory of sig 
three hypotheses, inclu 


f Wald was his demonstration that 


ding a null hypothesis and two alternative hypotheses 
at different distances on either side of the null hypothesis, an optimum 
statistical decision function often is available to select the most probable 
of the three hypotheses on the available data (Lehmann, 1959). 

The mathematical complexity of statistical decision theory and the 
great amount of information which is required to use it have discouraged 
its application in behavioral science research. Its theoretical formulations 
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; — is att 
have helped to clarify a number of problems in the conventional use o 
1a y 
significance tests (Kaiser. 1960). 
sig 


Contributions of Bayesian Statistical 
Theory to Statistical Inference 


Bayesian statistical theory, ‘for the most parl, is not so oe pe 
significance tests for informative statistical inferences as is classits 
statistical theory, namely the theories of Fisher and Neyman and Pearson. 
With rare exceptions, the British-American school of statistics has on 
firmly entrenched in the objective or frequency interpretation of pro d 
bility and has eschewed the use of Bayes's theorem or the likelihood func- 
tion as bases for formulating statistica] inferences. The Continental "eek 
on the other hand, has been concerned with giving formal mathematica 
treatment to the subjective theory of probability or with developing statis- 
tical methods based on the use of Bayes’s theorem (Savage, 1954). u 

One British scientist working somewhat independently of the British- 
American school, his work unrecognized until recently by the majority of 
American statisticians, has made many important technical advances in 
the use of Bayes's theorem, Jeffreys (1957, 1961) has treated the c 
of significance tests in à unique manner. Tests of significance are usec 
to assess whether the hypothesized parameter(s) can be considered to 
account adequately for the obtained dat 
posterior probability distribution to the 
the prior probability distribution, is not sa 
ter is introduced into the model. The proc 
null hypothesis is continued until 
that is, within the limi 
measurement, Jeffre 


a satisfactory approximation is aunimen 
ts of random sampling variation and errors o 
ys has emphasized the importance of refining measure- 
cement of scientific knowledge. For Jeffreys 
ientific investigation has been the verification 
es. Perusal of the computations required a 

for t-tests of significance would dismay the 


average social scientist accustomed to t-tests without consideration of 
prior probability distributions, : 


The merits of Jeffreys? work have been recognized by Lindley (1953. 
1961), Raiffa and § 


chlaifer (1961), Savage (1954, 1961), and Savage 
and others (1962). H he French and Italian probability 
theorists have influenced the Surgence of Bayesian’ statistics, especially 
Bayesian statistical decision theory in the United States (Raiffa and 
Schlaifer, 1961; Roberts, 1962; Schlaifer, 1959), m 
In his compendious discussion Concerning the foundations of statistics 
of ,10 years ago, Savage (Savage, 1954. Edwards, 1956) argued ge 
various aspects of the classical theories of statistics could be reinforce 
work relevant to the behavior of the 
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scientist and the decision maker by adopting the personal interpretation 
of probability. At that time, he critically evaluated many of the conven- 
tional practices in the use of statistical methods, especially the widespread 
use of extreme null hypotheses which are known to be false prior to ex- 
perimentation. Subsequently, Lindley (1961) and Raiffa and Schlaifer 
(1961) have built upon the work of Jeffreys a still incomplete structure of 
statistical theory and methods using Bayes’s theorem, Edwards. Lindman, 
and Savage (1963) have presented to psychologists some of the funda- 
mental ideas and techniques of Bayesian statistics. 

Bayesian statisticians have pointed out that classical statistical methods 
do not make full use of available information and that they do not pro- 
vide concise and relevant answers to the questions of greatest importance 
to investigators. Furthermore, in many situations the decision procedures 
do not meet the basic criterion of consistency (Pratt, 1961b). 

Classical and Bayesian formulations have points in common. Both have 
the following elements: alternative hypotheses or acts, possible parameter 
values or states, the possibility of sampling (experimentation) to obtain 
information about the parameter value or state, the sampling or experi- 
mental results in the form of descriptive statistics, and either loss structures 
or error rates. Both seek the best decision rule which will minimize loss 
or error in the decisions or conclusions made following experimentation. 

The classical decision maker chooses an act on the basis of the outcome 
of a sample which is conditional upon the parameter and the type of 
sample and experiment. Within the decision rules for significance tests, 
the so-called conditionality of the experiment and of the sample is not 
the statistician must use his judgment to give 


adequately represented; t 
: deo ditions when he interprets the obtained level 


appropriate weight to the con 


of significance (Cox, 1958b). . 
The Bayesian statistician pursues a different course. He begins by mak- 


ing probabilistic statements about the parameter under investigation. The 
probabilistic statements are in the form of a prior distribution. Then, de- 
pendent upon how precise he wishes to be in his final probabilistic state- 
ments about the value of the parameter, he sets a limit, or an upper bound, 
in the form of an expected value on the cost or worth of experimental data 
act or decision. From this point of view he decides 
whether it is worthwhile to sample. As sampling information becomes 
available, the Bayesian modifies his prior distribution in the light of 
sample evidence and thereby obtains a posterior distribution. By using 
some specified decision rule, he may decide that he has sufficient informa- 
tion to come to a decision or a conclusion; or he may decide that his 
uncertainty has not been reduced enough and that, therefore, he must 
continue sampling until he has a suitable basis for a terminal act or 
conclusion (Raiffa and Schlaifer, 1961; Roberts, 1962). 

The Bayesian analysis formalizes many aspects of statistical practice 
which classical methods leave to the judgment of the investigator. For 
the orthodox contention that there is often little objective basis for arriv- 


Apo 


for selecting a terminal 
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ing at prior probability distributions, Bayesians have countered puc den 
at least a number of situations the prior distribution assumes little weigh 
in the final or posterior distribution (Edwards, Lindman, and Savage, 
1963; Savage and others, 1962). Several interesting solutions to this 
problem have been indicated by Raiffa and Schlaifer (1961). : 
Edwards, Lindman, and Savage (1963), Lindley (1961), and Savage 
and others (1962) have discussed some of the advantages of Dayesian 
significance tests over classical tests. The first-mentioned authors have dis- 
cussed the importance in many educational and psychological research 
problems of taking possible losses or gains and prior information into 
account. For example, classical t-tests and F-tests have been used to study 
the differential effects on several groups of several methods of instruction. 
In neither approach has it been practical to take into account considera- 
tions of previously available information about the differential effective- 
ness of one method versus another; nor has it been customary to give 
attention to various kinds of losses, economic or learning, which might 
be associated with the adoption of one method rather than another. The 


authors have showed how Bayesian procedures can provide appropriate 
answers to such problems, 


The Likelihood Principle and Statistical Inference 


The likelihood princi 
a part of the armament 
it has been given little a 
The likelihood function 


ple, like Bayes’s theorem, has been recognized as 
arium of statistics; but, as with Bayes’s theorem, 
ttention as a primary tool for statistical inference. 
is represented by the distribution function of the 


is determined by the 
specified experiment ( 
(Birnbaum, 1962) 


the problem of having to modify 
confidence levels in the light of 
likelihood function clearly as- 
ntal outcomes which have not 
nt. One of the main objections 
lity distribution for significance 
l possibilities are irrelevant for 
tual observations. The likelihood 
pendent upon the sequence of or 
whereas orthodox significance tests and 
als cannot be used when sampling is stopped arbitrarily. 
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Use of Significance Tests for Statistical Inference 


The test of significant difference has been called the prototype of modern 
experimental statistics (Tukey, 1960a). It is a qualitative rather than a 
quantitative procedure for statistical analysis and inference which is used 
to answer questions such as this: Dare we conclude that the difference is 
not zero? A classical significance test assays whether the two statistical 
hypotheses A and B are equal or whether A is less than or greater than 
B. The failure to obtain a significant difference does not warrant the con- 
clusion that A is equal to B until careful consideration is given to the 
specific experimental situation, the available evidence, and the assessment 
of the consequences for such a decision or conclusion (Tukey, 1960a) . The 
precision of the experimental comparison, the power of the statistical test. 
and the theoretical and the empirical closeness of A to B must be examined 
before the lack of statistical significance is interpreted as a positive find- 
ing (Cox, 1958a; Tukey, 1960a). The effectiveness of confidence intervals 
in showing the probable relationship between A and B indicates their 
superiority over significance tests for informative inference. Statisticians 
have given a great deal of attention to the use and the misuse of signifi- 
cance tests (Anscombe, 1961; Bahadur and Robbins, 1950; Good, 1958; 
Kish, 1959; Lindley, 1958; Savage, 1957; Selvin, 1957; Sterling, 1959, 
1960; Williams, 1959). There has been general agreement that significance 
tests have been used too frequently in social science research (Grant, 1962; 
Harrington, 1961; Lubin, 1962; Nunnally, 1960; Rozeboom, 1960; Savage, 
1957; and Wilson, 1961) and at the expense of more appropriato, pro- 
cedures, such as confidence and fiducial intervals acp pu po 
1962; Kish, 1959; Lindley, 1958; and Savage and others, 1962). Sig- 
nificance tests without power functions do not answer such gassbnos as 
these: How far is the sample statistic from the null hypothesis? How much 


credence should be given to the null or the alternative hypotheses? 

The situations in which classical significance tests can tyi ics are 
i 5 i fulness of significance 
limi e (1961) has pointed out the use g 
lw po theoretical hypotheses or models. 


ting 

tests as a fundamental method for testing t? ; 1 

For this purpose, the null hypothesis is given Some eretas ihu Mia a o 
» u 


of theoretical and experimental consideration. The aie oe 
determine in a new sample whether the null liypor e " ena 2 i 
observed departures being no more than those expectei dual iin ba. 
random sampling variations. When the Action s d Aou 
Vestigator considers whether another gei ould - e incedere 
the statistical model or whether some other mo — in the ow 
be made. Another use for significance tests occurs p pa m experir ^ e 
wishes to test the adequacy of a particular Api Tai nalis us 
or technique, a particular stochastic process, or an experumt p à 


He is interested in determining whether the methods used produce the 
dedi a cra or uniformity with due allowance for random sampling 
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errors (Anscombe, 1961). Significance tests are also applicable he i a 
investigator is concerned with testing a particular statistical hypo M 
when a set of specific alternatives has not yet been conceptualized (Sav age 
and others, 1962). When a research worker wishes to verify a a 
that an experimental result has been in a specific direction, one-tailed 
significance tests are appropriate (Anscombe, 1961). i 
Significance tests can be informative in later stages of research w m" 
estimation is included in the statistical procedures. The null hypothesis 9 
no difference has been judged"to be no longer a sound or fruitful basis 
for statistical investigation. Both the null and the alternative hypotheses 
are formulated to include estimation of relevant experimental variables 
(Bush, 1963; Grant, 1962: Nunnally, 1960; Tukey, 1960b) . . 
Significance tests can be divided into two groups: those for which a 
set of alternative hypotheses can be defined, such as tests of specific pa- 
rameter values, and those for which a set of mutually exclusive and ex- 
haustive hypotheses are not available, such as tests of randomness. The 
former almost always have comparable interval estimation procedures— 
at least, such procedures can be derived—whereas the latter pose serious 
problems for the derivation of interval estimation procedures. When 
comparable interval estimation procedures are not available, then an in- 


vestigator has no choice but to use significance tests, as in the use of some 


distribution-free methods (Kendall and Stuart, 1961; Savage and others, 
1962). 


There are many statistical problems for which both significance and 
interval estimation procedures are available. Similar answers are obtained 
if the procedures are used effectively and appropriately (Berkson, 1942; 
Birnbaum, 1961; Kendall and Stuart, 1961; Natrella, 1960; Pratt, 1961a). 
Statisticians have agreed that interval estimation does give a more intui- 
tively understandable summary of the data than do significance tests with 
power functions (Kish, 1959; Natrella, 1960). 

Among statistical problems which of 
or interval estimation procedures are 


wherein a family of simple hypotheses 


on the basis of the data, and (b) composite preference problems, wherein 
a family of composite hypotheses is ranked on the basis of the data 
(Lehmann, 1959; Savage and others, 1962). These problems convention- 
ally have been classified under the rubrics of hypothesis testing, point and 
interval estimation, and discrimination (Savage and others, 1962). 


ten can be treated by either testing 
(a) simple preference problems. 
is ranked in order of credibility 


Problems in Statistical Hypothesis Testing 


There are many unsolved problems in the theory and the practice of 
statistical hypothesis testing. Some of the theoretical problems have been 
mentioned at the beginning of this chapter. In the following paragraphs 

` some of the practical problems are summarized. . 
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Decision Theory and Conclusion Theory 


Tukey (1960a) has suggested that some of the misunderstanding sur- 
rounding the use of significance tests can be clarified by distinguishing 
those statistical problems concerned with the necessity of making a choice 
between two alternatives from those concerned with the accumulation of 
evidence in support of a theory or set of working hypotheses. Decision 
theory is typically concerned with problems in which economic losses or 
rewards, a set of possible actions, and thei: consequences in various states 
of nature can be definéd. Decision theory counsels the statistical consumer 
about how to choose wisely among available strategies (Bahadur and 
Robbins, 1950; Flanagan, 1958). 

Conclusion theory, on the other hand, is concerned with evaluating the 
adequacy of evidence. Conclusions need not be made if the evidence is 
deemed inadequate. A conclusion is accepted relative to the conditions of 
pelling evidence to the contrary is found. De- 
cision procedures choose between two hypotheses in terms of minimizing 
the risk for an action, while conclusion procedures often are concerned 
with controlling errors of both the first and the second kinds at suitably 
low levels in choosing between two hypotheses (Anscombe, 1961: Lindley. 


1961; Tukey, 1960a). 


an experiment until com 


e Null Hypothesis 


Rejection or Acceptance of th 
atisticians and behavioral scientists 


Much of the controversy among st ian f dem 
about the theory and the practice of statistical hypothesis testing has 


centered upon whether the null hypothesis can reasonably be accepted. 
Grant (1962) has argued that experiments oriented toward the iani. poe 
of null hypotheses are inappropriate. Binder (1963) has dme 
the Neyman-Pearson formulation "cam i ck nd f € 
hypothesi hough, unless some acju* ment in sig s 7 
ees ut to reject the null hypothesis "€ -— sm die 
large or when powerful tests are used (Pratt, up ). ; d T 
Jection of the null hypothesis by classical methods p IN 
of the Bayesian criticism (Edwards, Lindman, an ge. ; 


and oth : : . xui 
devi E Anscombe (1961), Good “aye Ces ba 
1961) S d others (1962), and Sterling (1960) à T 
that hs avage ant nz should be directed toward testing P. peii 
sl ed bep ierd testing implausible ones that can be rej 
Y a single observation. 


Statistical Significance and Substantive Significance 
do not signify” (Savage and others, 1962) has 
eii roblem in the use of significance 


That “signi level 

gnificance leve 4 
imary P 
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tests. Among the proposed solutions to the problem have been the inclu- 
sion of a distance function to indicate how deviant the data are from the 
null hypothesis (Bulmer, 1957), the use of a form of variance in analysis 
of variance to show how well the independent variables account for the 
obtained results (Bolles and Messick, 1958; Gaito, 1958), the substitution 
of interval estimation for significance tests to give a quantitative repre- 
sentation of the spread of the obtained values (Birnbaum, 1961; Kish. 
1959; Natrella, 1960), the use of Bayesian methods (Edwards, Lindman. 
and Savage, 1963), and the use of the simple likelihood function (Birn- 
baum, 1962). 

Significance tests as methods for informative inference have heen criti- 
cized severely (Birnbaum, 1962; Edwards, Lindman, and Savage. 1963: 
Lubin, 1962; Rozeboom, 1960; Savage and others, 1962). Mere inspection 
of the data often reveals the untenability of the null hypothesis. When there 
are grounds for believing that random sampling variations alone do not 
account for the data, a significance test is not the most appropriate method 
to use (Williams, 1959). Significance tests do not summarize the evidence 
(Birnbaum, 1962; Lindley, 1958; Tukey, 1962; Wilson, 1961). Much 
caution must be used in interpreting significance levels, for the obtained 
level of significance is not consistent from sample to sample or among 
comparable tests (Pratt, 1961b). The obtained level of significance is 
dependent upon conditions which do not modify the obtained level 
(Birnbaum, 1962). There are many conditions which vitiate the accuracy 
of the obtained significance levels, even for robust test procedures (Tukey, 
1962). Often graphical representation indicates the presence of disturbing 
conditions, but social scientists do not use such a procedure regularly 
(Tukey, 1962). Classical significance procedures are not sensitive to the 
size of the error (Wilson, 1961) ; often it is the size of the error that an 
investigator wishes to know to assess his procedures. 

An artificial level of significance is not a sufficient or an efficient indi- 
cation of the presence or absence of evidence; yet editorial policies have 
chosen to ignore this fact in their insistence on extreme levels of signifi- 
cance (Melton, 1962). Publication policies have posed other problems for 
the interpretation of test results. Sterling (1959) has commented upon the 
prevalence of Type I errors among psychological publications associate 
with the suppression of nonsignificant results 
inclination of editors to publish mainly articles with statistically significant 
findings. Cohen (1962) has found that articles on social and abnorma 


psychology have not been appropriately concerned with the power of the 
designs and tests used. 


as a consequence of the 


Significance Tests and Statistical Assumptions 


The assumptions underlying all phases of statistical methods constitute 
pervasive problems in statistical practice (Savage, 1957; Savage and others, 
1962). Rarely can an investigator be certain that the assumptions are ade- 
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quately fulfilled in any particular problem (Tukey, 1960a, 1962). Among 
the important problems regarding assumptions in ordinary statistical 
practice are the following: (a) the appropriateness of the probability 
model to characterize the population (Savage, 1957; Neyman, 1960): 
(b) the inappropriate use of small sample methods before large sample 
methods have been used to make estimates of variance (Nunnally, 1960) : 
(c) the robustness or Sensitivity of statistical methods to violations of such 
assumptions as normality, independence. homogeneity, additivity (Boneau, 
1960): (d) the selection of test procedures with due awareness for the 
performance and the requirements of the procedures and the nature of 
the data (Binder, 1959; Boneau. 1960: Kendall and Stuart, 1961: 


Lehmann, 1959): (e) the use of transformations of data to meet the 


assumptions of the more powerful statistical methods versus the use of the 
less powerful distribution-free and nonparametric methods (Boneau, 1962: 
Lehmann, 1959; Savage, 1957). Always the investigator must remember 
that different tests based on different assumptions when applied to varying 
sets of data lead to different results (Tukey, 1962). Often the precision 
promised is not the precision delivered (Tukey, 1954, 1962). . 
Another basic assumption underlying classical statistical methods is that 
of randomization. Psychologists have been criticized for not being suf- 
ficiently concerned with random sampling procedures. the definition of 
target populations, and the limitations in the representative nature of most 
samples studied (Kish, 1959; McGinnis, 1958; Nunnally, 1960; Stevens, 
1960). McGinnis (1958) and Kish (1959) have discussed the validity of 


ieni : i xperiments. 
Significance tests in surveys Versus well-controlled ex} 


pothesis Testing 


Data Analysis and Statistical Hy 
of current statistical practice has 


Tukey (1962) commented that much 


emphasized the development of elaborate methods at the expense of at- 


tention to the analysis of data. Instead of concentrating upon pein SS 
i rn scientists, statisti- 

nat upon the questions that conce sts, sti 

is oh the daig anil oP commit errors of the third kind— 


cal ie " en prone to ea 
practitioners have been p uestions—which is perhaps the 


that is, giving exact answers to the wrong q n n t 
most is iia the three kinds of errors (Schlaifer, 1959). The routine 


use of significance tests has perpetrated many nr a aor a oe 
as Wientlóned above, significance tests do not s e : nus nde 
Scientists need, and, furthermore they are not ne — ci pal 
for analyzing and summarizing data. Tukey = er m im 
that attention should be given to adapting methods to problems ra her 


than to forcing problems into particular methods. 


Statistics and Experimentation 


Statistical methods have been widely wsp 
PXperimenta] controls are either not 16 


used in the sciences in which strict 
Je or desirable. The usefulness 
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of statistical methods for designing and analyzing investigations has i. 
widely recognized; however, there have been dissenters. Lo T 
Hogben (Hogben, 1957: Stevens, 1960), who have maintaine Ha i 
most appropriate solution for the many problems associated with the hen 
and practice of statistics is the replacement of statistical methods by hei 
controlled experimentation. Lindley (1958), in his response to Hog en 
criticisms, contended that statistical methodology requires extensive modi- 
fications in order to play an.effective part in science, but that vane 
will doubtlessly continue to have an important role in all kinds of research. 

The problems of the relative emphasis which should be given to 


P : MN . : it an 
considerations of statistical versus experimental design does not admit 
easy solution. 


Statistics and Psychological Theory 


If statistics is to make significant contributions to the testing of scientific 
hypotheses, then the methodology of statistical hypothesis testing must 
incorporate theoretical formulations of the subject matter under i = 
tion. Statistical inferences are maximally informative if they are supported 
by other findings and if they are enmeshed in an expanding network of 
hypotheses and constructs. T 

The constitution of a set of mutually exclusive and exhaustive statistica 
hypotheses requires careful consideration of relevant theoretical impen 
tives in order to select heuristically valuable alternatives (Savage ant 
others, 1962). Recognition of the limitations in the theoretical formula- 
ions and in the statistical methods sho 
unwarranted statistical and scie 

Bush (1963) 
ness of fit betwe 


uld motivate prudence against 
ntific inferences (Jeffreys, 1957). 

remarked that theories are not substantiated by the good- 
en the data and a hypothetical statistical model but rather 
by the estimation of significant parameters in the statistical model so tha 
information about the functions of experimental and theoretical variables 


can be accumulated. Estimation is essential for the testing of accurate an 
fruitful theories. 


Conclusion 


While the so-called crisis in statistics (Hogben, 1957; Lindley, 1958; 
Stevens, 1960) has not been resolved, it has highlighted many of the 
shortcomings in such frequently used methods as significance tests, 4 
it has evoked constructive thought about the very foundations of statistica 


inference (Savage and others, 1962) as well as about the rationale © 


statistical methods (Tukey, 1954, 1962). 


: > — 
If educational and psychological research workers heed the admonition: 
and the recommendations of statisticians, 
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M ee practices may be anticipated. At least investigators will be wary 
o the routine application of significance tests as the main basis for 
statistical inference. 
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CHAPTER II 


Selected Contributions to Parametric and 
Nonparametric Statistics 


WILLIAM B. MICHAEL 


A vmovca THERE is a slight duplication of the research reviewed in lus 
chapter with that to be found in Chapters I and III, an effort has been 
made to include those contributions to statistical methodology that were 
thought to be of at least some interest and of some potential help to ee 
search workers in the behavioral sciences, The scope of this particular 
chapter is probably broader and perhaps less unified than that of any 
other in this issue of the Review. In this chapter, the major objective 4 
to furnish for the period from July 1, 1960, to July 1, 1963, a re 
description both of the general developments in statistical inference ane 
of the somewhat specific contributions that have been made to the theory 
and application of parametric and nonparametric tests of significa 
especially those tests involving only two samples. Consideration has also 
been given to citing important books, to describing advances realized in 
the treatment of data in contingency tables, and to mentioning certain 


developments in the methodology of multiple comparisons that have not 


been noted by Norton in Chapter IIT. 


Books 


As one might expect, scores of bo 
appeared since July 1960. In this ch 
which for the most part are somew 
example, books will not be cited 
such specific problem areas as an 
gression and correlation, factor 
measurement theory, statisti 
statistical learning theory, 
analysis, as the authors of 
of the important books in these specific areas, 


For research workers in education and in the behavioral sciences, the 
third revision of the well-known text by McNemar (1962), though not 
much changed, remains as a standard reference text, For a similar ub 
the relatively elementary text by Peatman (1963) should also be pe 
Of particular interest to the behavioral scientist is Horst's (1963) van 
text concerning matrix algebra. For the educational research worker who 
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oks on statistical methodology have 
apter, those books will be mentioned 
hat general in their orientation. por 
which are concerned primarily with 
alysis of variance and covariance, m 
analysis, sample surveys, scaling ia 
cal decision theory and utility pint 
and computer methodology including inen 
the other chapters have made reference to mos 
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wishes an introduction to probability and statistical applications in terms 
of a modern point of view embodying elementary set theory. the volume 
by Mosteller, Rourke, and Thomas (1961) should be especially helpful. 
Requiring a relatively elementary level of mathematical sophistication, 
about the equivalent of one or two years of college mathematics, the texts 
by Alexander (1961), Freund (1960, 1962), and Kozelka (1961) are 
noteworthy. Demanding considerably less mathematical maturity of the 
reader than the four books just cited is a text by Hammond and House- 
holder (1962), which rests heavily upon a highly verbalized, nonmathe- 
matical approach to the understanding of statistics. 
At a high level of mathematical sophistication are the following works: 
the second volume of Kendall and Stuart's (1961) The Advanced Theory 
of Statistics, which is concerned with statistical problems of inference and 
relationship; Walsh's (1962) Handbook of Nonparametric Statistics; and 
Wilks’s (1962) Mathematical Statistics. Probably the most „profound 
treatise concerning the nature of probability is the revised edition of the 
volume by Jeffreys (1961), who assumed the point of view of a philosopher 
of science. Constituting a monumental contribution to the conceptual frame- 
work of probability and statistics is the short but penetrating monograph— 
a collection of papers—by Savage and others (1962) titled The Founda- 
tions of Statistical Inference, to which reference will be made in the next 


section. 


Additional References: ; Savage (1962). 


Sarhan and Greenberg (1962) 


General Contributions to the Theory of 
Statistical Inference 


Three Basic Contributions 


Norton have given attention both to 


" k and 
mne. api Lugd HL, ee ] paper on the foundations of sta- 


Birnbaum’s (1962) general theoretica on nda of 
tistical in in which the likelihood function 15 the unifying principle, 
: 


and to Tukey's (1962) comprehensive paper on ne ee ae ay en 
sis. The accompanying discussion and comments 0 de ne "s in 
cians concerning Birnbaum’s formulation are particulary’ aa bw Sav n 
Chapter I, Clark has cited 20 times the important monograp ^u ver 
and i (1962), in which Savage not oy argie Je n inei 
tail for a theory of statistical eee "p rise S British tiia 
also answered questions from a number o a eee an n 
Who had d alternative pomis esa nt ci lv interested both 
k presente logical research who is seriously interested | 

in educational and Lee oe ons of statistical inference 
and the 


i tual foundati sta 
ìn trying to underst isely the tests of statistical methodology 


apn attempting e Sri great eare the quanta 
iad rerea e » majority o a atively 
ti ula read and re sre will be much that the majority q ) 
ions, Admittedly. there 416 
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oriented research workers in both education and psychology will not grasp 
to their complete satisfaction in view of the uneven nature of the demands 
placed upon the level of mathematical maturity of the reader. Nevertheless, 
there are many key concepts, basic points of view, and evaluative com- 
ments that are not only quite comprehensible but also helpful in broadening 
one’s orientation to the basic problems of statistical methodology. 


Contributions from AERA-Sponsored Publications 


plays in the application of the 


the chapter by Tatsuoka and 
Tiedeman (1963) in the Handbook of Res 


earch on Teaching, edited by 
Gage (1963), should be most helpful to its intended audience. The authors 


es and furnished a helpful table 
nize these statistical techniques 
red for appropriate application, 
ependent variables present. 


Contributions from Psychological Literature 


Z of statistical inference in psychology. 
In the main, they have been written in such a manner that most research- 


oriented psychologists and educational psychologists, irrespective of area 
of major Interests, may be expected t 


the material presented. Since, in Ch 
these contributions Several times a 
their contents, each one will be cite 
pose of furnishing a relatively 


cized what he termed the "null-h 
inappropriate method of infer 
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strategy of experimental investigation of theoretical models, Grant (1962) 
expressed doubts regarding the value of testing the null hypothesis when 
theoretical considerations lead to a prediction of differences of specified 
amount, and he then recommended a procedure involving estimation of 
confidence intervals with respect to the empirical means (averages) that 
can be referred to a theoretical curve. Taking a somewhat contrary posi- 
tion to that of Grant, Binder (1963) emphasized that the optimal design 
of experiments with sufficient built-in sensitivity to allow for acceptance of 
useful theories and for rejection of unsuitable ones is the key point rather 
than Grant's argument concerning the inappropriateness of an experimental 
design in which acceptance of the null hypothesis will suggest support for 
a theory. 

Of marked import 
was the comprehensive and defin 
Savage (1963) concerning the u 
psychological research. They poin 
by the Bayesian statistician may frequently up] 
null hypothesis, whereas use of standard statistic: 
to the rejection of the null hypothesis. 


ance to educational and psychological research workers 
itive paper by Edwards, Lindman, and 
se of Bayesian statistical inference in 
ted to the fact that evidence obtained 
support the acceptance of the 
al procedures would lead 


Contributions from Mathematical Statisticians 
From the mathematical statisticians, three papers nd 
nature deserve brief mention. In an expository article which re 


developments in inferential statistics over à ae af bg LE e 
Pearson (1962) evaluated the work which he eie 1 ds ne to af 
cerning confidence intervals, graciously acknowle Beg Dus i view, iid ex 
other statisticians such as Fisher with differing Paad] ; and rior 
pressed certain reservations about subjectively 1962) r aretha deer 
distributions of Bayesian statistics. Brillinger ( : A red vdsitunb 
requirements for application of the fiducial argumen ds les, and included a 
to Bayesian theory, presented arom i peso paper, Fraser 
comprehensive bibliography of 54 references. in a E 


: | t have been analyzed in terms of the 
(1961) reviewed several apu lap 1 m e fan ionis 


e A ed a num i oir these pre 
ane " gener bn mathematical ini pa which 
>a ; a i 

Probability ipa i could be related to a large class of problem 


Theory of Estimation 


Point Estimation 
ning the theory of estima- 
B t noteworthy paper concer g J i 
genus d Pe A (1961), who extended and — €— 
ee i f view regarding estimation 10 - 
Previgi ions and points oi Vl 
us formulati $5 
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parameter problems. His basic criterion was that of admissibility of a 
point estimator, which was considered in terms of its full distribution 
rather than in terms of its special loss functions. Both theoretical bases 
for characterizing admissible estimators and practical computational meth- 
ods were included. Moreover, Birnbaum succeeded in generalizing Fisher's 
maximum likelihood principle of estimation, gave it exact (or nonasymp- 
totic) justification, unified it with a theory of tests and the theory of 
Neyman-Pearson confidence regions. and described relationships between 
exact and asymptotic results. , 

To furnish an estimate of the common mean from two independent 
sample means as well as estimates of the separate variances of two normal 
populations, Fields, Kramer, and Clunies-Ross (1962) described an itera- 
tive method that yields either maximum likelihood values or quantities 
only slightly different from them. In 
statistics, Hodges and Lehmann (1963 
point estimation, defined point estimates of location or shift in parameter, 
described their success in providing robust power, and proved certain 
properties of invariance and regularity. 


terms of nonparametric rank test 
) explored different approaches to 


Interval Estimation 


During the past three years, much attention has been placed upon the 
length of confidence intervals, 


In the determination of the size of a sample 
in order that there will be a specified probability 1 — a that a confidence 
interval contain a parameter and that there be a specified. probability that 
the width of the confidence interval will be less than a certain designated 
amount, Graybill and Morrison (1960) outlined two requirements that 
must be met: (a) knowledge of the form of the frequency function from 
which a sample of size n is taken and (b) availability of previous informa- 
tion regarding the unknown parameter in the distribution. They suggested 
that a first sample be employed to determine the number of observations 
to be taken in a second sample in order that the two requirements may be 
satisfied. 


Since, at most, only one point within a confidence interval can be the 
parameter, one possible measure of the desirability of a confidence interval 
procedure is the extent to which an interval contains false values as Te 
flected by its expected length, since this length is an indication of the 
"average extent" to which false values are included. Embarking upon ? 
new approach, Pratt (1961) attempted to determine the probability thal 
a confidence interval procedure contains a particular false value through 
integrating over false values. In demonstrating that the expected length o 
a confidence interval is equal to the integra] over false values of the proba- 
bility that each false value is included, he noted that his approach yieldec 
an equivalent measure for choosing among confidence interval procedures 
and presented numerical examples to illustrate his point. In a subsequent 
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article based upon the work of Pratt, Madansky (1962) took a specific 
example, applied two different methods of obtaining a confidence interval 
for a parameter, found two different values for the interval, and showed 
that for one-sided intervals the expected length of a confidence interval 
and the probability of covering false values were not related. 

In other papers, Halperin (1961) discussed confidence intervals in the 
instance of censored samples; Harter (1960b) furnished tables of per- 
centage points for the range, moments of the range, and the studentized 
range; Blyth and Hutchinson (1960) presented a tabulation of the .95 
and .99 confidence intervals of the binomial parameter that were based on 
Neyman’s shortest unbiased estimates; and Pachares (1960) also developed 
extensive tables of confidence limits for the binomial distribution for values 
of .005, .010, .025, and .050 in a. 

Additional References: Birnbaum (1961) ; Harter (1961, 1963) ; Murthy 


(1962): Pachares (1961). 


Inference Concerning Authorship 


In a paper of considerable potential interest to historians and pu 

analysts who may wish to infer the authorship of documents in papets 
Mosteller and Wallace (1963) undertook an investigation im w ich they 
compared discrimination methods, carried out explorations with ss 
approaches in large-scale data analysis, attempted to paper whet t 
Madison or Hamilton wrote the disputed Federalist papers, an ror 
routine ways for solving problems of authorship. They ae t iind 
frequency of appearance of articles, prepositions, and agua ie which 
occur at fairly stable rates, yield important ce gin (e ormation. 
On the other hand, many meaningful nouns related to the b e aped 
at hand do not provide stable rates of frequency per pige oste i an 
Wallace reported odds that were highly in favor of | one having 
written all 12 of the Federalist papers that have been in dispute. Te of 
different sets of prior as poms was considered to be a worthwhile 
pro re i inferential process. R . 
Ser ihe siegt et and university scc € anliy re- 
view committees that are responsible for ps — m 
cerning whether professors should be promote i e i -a 
given increases in salary; the techniques propose 4 € aoa hy 
may be of considerable interest—especially in i ne ee 
Teprints presented by a professor as evidence o ^ i olarly wem 
list two or more authors. These techniques would a m Har ee n with 
a crusading. or crucifying spirit a means for inferring x E her Scd po 
fessor ünder consideration in eed did undertake dene d ig wri ing rà 
pearing in his joint publications or whether his co ^ p y 
Prepared the text. Of course; samples of previously pu ishe: s ings by 
each coauthor would be necessary to furnish a base for comparison. 
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In the event that the reader, who as a coauthor himself may bastir 
gling in the publish-and/or-perish reprint race, is Vei to en 
at the thought of possible exposure, this writer does not wis V to iim 
him unduly, for it is highly unlikely that any senior faculty (t€: si 
member or any administrative officer would dare take precious time 2 
respectively, his own zestful scholarly activities or his challenging ue 
work responsibilities for such an unrewarding inquisition. Then, ah is 
is the probability that either a college administrator or anyone with vw 
statistical competence to do su^h an inferential study would be reading t 


chapter? It would seem safe to say that the probability is low—exceed- 
ingly low indeed (p — .01). 


Significance Tests 


Parametric Versus Nonparametric Tests 


In a highly cogent article, Anderson (1961) 
nonparametric tests both with respect to the 
significance level, power, 
consideration of the pro 
paper was limited to nu 
in his detailed argumen 
cance level and power o 
order data is not great, 
ford the necessary vers 
psychological research 
basic tools in psychol 
of measurement, And 


ric and 
compared parametric anc 


: oup easly 
results furnished by application of a parametric test, argued convincing 
that numerous practi 


ariance, or generality of DON 
far overshadow refinements of scale typology, and pointed out that a 
ferences regarding psychological meaning are actually primarily 5" 
stantive and judgme: 


H i ng 
ntal in focus rather than fermal in the sense of being 
related to the reliability of data. 


st 
Robustness of the t-Test and the Wileoxon-Mann-Whitney U-Te® 


In terms of both a rev 
of samples from populat 
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Boneau (1962) made a comparison of the power of the t-test against the 
power of the frequently used nonparametric Wilcoxon-Mann-Whitney 
U-test. For both one-tail and two-tail tests, various combinations of sam- 
ple size and of population variances were taken systematically along with 
prescribed differences between population means being present. In gen- 
eral, when the variances of the two populations were homogeneous there 
was a remarkable degree of closeness in the empirically derived power 
curves of the t- and U-tests relative to samples of sizes 5 and 5 and of 15 
and 15. In the instance of samples. one of size 5 and one of size 15, the 
degree of resemblance was still close, although the power of the t-test was 
slightly superior to that of the U-test. For the situation of normal distri- 
butions, of heterogenous variances in the two populations (the ratio of 
the variances being 4 to 1), and of samples of equal size, the power of 
both the ¢. and U-tests remained relatively high, although that of the 
t-test was slightly greater than that of the U-test. 
When the size of one sample was three times larger than the other, and 
when the corresponding variance in the population from which the first 
sample was taken was also the larger of the two population variances, 
there was a substantial loss in power for both the U- and t-tests, although 
the power of U was affected less than was that of t. The empirical values 
for the significance levels a were reduced below the intended, or nominal, 
significance levels of .05 and .01—the departures being greater for ¢ 
than for U. When the size of one sample was three times larger em et 
other, but the corresponding variance in the population iae rs 
sample was chosen was the smaller of the two population variances, 


eflects were in the opposite direction. a. both U ea doy er 
val ; han the intended, or nominal, : 
alues for a were larger tha am ilm for U. Moreover, the power 


.0l—the discrepancies being larger [ 
of t was airo than that of U. These results held el in sre e 
two-tail tests, although the two-tail tests did show slightly sm 


crepancies in distance between empirical power functions D ef en P than 
did the one-tail tests. In short, when sampl aa el distor ba 
in the two populations were heterogeneous; c : ; j 
empirical van levels and in the degree of empirical papam piis 
relatively greater for t than - v for a; one-tail an sts, 
although both tests were affected adverse y- . 
In the instance of sampling from nonnormal Lee prp 
and exponential), it was noted that when the two dd utio " uds 
Same shape, the U-test appeared to be consistently slightly ssn ae = 
than the t-test for small differences Letween population mea eel 
for larger differences. The shapes of the power priam — ied 
Similar. When one distribution was normal and the other was 4s t€ 
oneau found that the empirically derived values ue 2 pic 8 A 
slightly closer to the nominal levels of a for U than or ! = — Xs 
oth one-tail and two-tail tests and that the empirical pow [wie cu E 
in general were only slightly higher than those for t. en the sizes 
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of samples from two populations of different shape were increased, the 
slight advantage, if any, which U held over t disappeared. The author 
also suggested that, in general, empirically derived power functions for 
the t-test are probably much closer to the theoretical power function than 
are the seemingly more variable empirical curves for the U-test. 
Considering his data, Boneau took a moderate view concerning whether 
the t-test or the U-test should be employed when assumptions underlying 
the ¢test are violated, for both tests are subject to a certain amount of 
error. However, he recommended that one not, desist from using the 
U-test in place of the t-test simply because it is less powerful than the /-test. 
Other articles were concerned with the robustness of the t-test compared 
with that of the Wilcoxon statistic. Van der Vaart (1961) reviewed several 
studies and noted that when the variance of one population is zero—an 
extreme case—the relationship between sample sizes serves to determine 
which one of the two tests is to be preferred. In a comprehensive paper. 
Wetherill (1960) concluded that if two distributions are nearly normal 
but differ in shape, the Wilcoxon test, although sometimes slightly better 
than the t-test, is frequently inferior to it. Although some of his findings 
were similar to those noted by Boneau, Wetherill emphasized the com- 
plexity of the power function of the Wilcoxon-Mann-Whitney U statistic. 
which itself is highly dependent upon the mean, skewness, and kurtosis of 
the distribution. In his critical review of the U- and t-tests, Lubin (1962) 
described in detail the complexities of the problem from both the statis- 
tical and the psychological points of view. His cogent discussion is worth 


serious study and careful reflection, although there appears to be a 


possible contradiction in two adjacent : us . 346, 
Il. 2427). Jacent sentences in one paragraph (p. 


Of considerable help to those who make frequent use of the Wilcoxon 


rank-sum lest statistic in the two-sample case, new tables of the exact 
critical values have been developed (Verdooren, 1963). The lower signifi- 
cance levels of -001, .005, -01, .025, .05, and .10 are presented for combi- 


nations of sample size u B in á 
j P to 25, along witl approximations à 
adjustments for ties, i g h two appr 


Miscellaneous Contributions to Problems of 
Robustness, Power, and Efficiency 
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dispersion, which are frequently employed because of their computa- 
tional simplicity and alleged robustness, with standard significance tests 
in order to ascertain the extent of similarity of conclusions regarding 
significance. Among several examples chosen for illustrative purposes, 
including an evaluation of the sign test relative to the t-test for paired 
data, the probability levels for the quick tests were only slightly larger 
than the chosen a levels for the standard tests of significance. Having 
noticed that the Kolmogorov-Smirnov test often has not yielded signifi- 
cant results for samples of moderate size when the chi-square test has 
done so, Durbin (1961) presented empirical results and then offered 
some methods of constructing exact tests of improved sensitivity that 
are intended to detect differences in two distributions. 

In other papers concerned wth the efficiency of tests, Mikulski (1963) 
compared the efficiency of nonparametric procedures in the two-sample 
case relative to parametric ones when the true distribution diverges from 
the assumed one; Chanda (1963) determined the efficiency of nonpara- 
metric tests for discrete distributions against the most powerful tests to he 
found for these distributions when they are members of parametric 
families and suggested that the values of the relative efficiencies may be 
dependent upon the extent of the discreteness of the distribution: Capon 
(1961), in a highly abstract article, studied the asymptotic pc ise 
several “locally most powerful rank tests”; and Sukhatme | (1960) dis. 
cussed the power of several nonparametric tests of dispersion. 


New Tests of Significance 


In the parametric case, two papers! were ere en aes » de 
hypothesis of equal means depending upon — S: x To test the hy- 
normal populations are homogeneous or Deterageneouss al wh les 
pothesis that the means of two normal populations are equa yhen anpe 
of small size are employed, McCullough, Gurland. and Rosenberg ( ) 


1 -olling the size of a test—that is, the 
ro Jo if shes for controlling : 
tae a vgn a. In what they termed a unilateral case, 


Value of significance Jevel . i 

à means iad ud for controlling a when the — oed an 
Population is known a priori to be larger iban n : perpe m mee 
what was called a bilateral case, 1 which Fa P also ee Sub- 
Population variances is available. a a several approaches 
sequently, Gurland and McCullough (1962) oT oh td Aisa tia A cha: 
through which preliminary Duis a ded ; equal. In light of the 
the variances of two populations may be won 25 i ibas ike Do means 
results obtained, one proceeds As the vim ye reliminary test is 
are equal. The choice of level of significance : Fw aee t nr 
anchored to the desired level of stipdücagen © guide one in studying 
tailed tzbles of size and power are furnished to g ying 
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In the nonparametric domain, several papers have been sac aig e 
the development of tests for differences in spread or dispersion. For aa 
independent samples, Siegel and Tukey (1960) developed a nonparame y 
sum of ranks procedure to test whether two samples arise from the m 
population with respect to spread, or variability, in scores. In addition 
inclusion of extensive tables of critical values for combinations of samp 4 
size from 1 to 20 (n,n, <20), large sample procedures were furnishe 
which also allow for a correction for ties. Since the F-test for differences 
in variance is relatively sensitive to departures from normality, this non- 
parametric test, which is entirely distribution-free in terms of customary 
randomization procedures, should be an important tool. Although this 
test is relatively sensitive to differences in dispersion when means or 
medians of the population are equal or nearly equal, its power decreases 
fairly rapidly as the difference between means increases and actually 
reaches zero when the populations do not overlap. 

A related contribution by Ansari and Bradley (1960) was that of a 
rank-sum test for significance of difference in dispersion of two inde- 
pendent samples that arise from populations with continuous cumulative 
distribution functions. In their test, knowledge of the difference between 
medians of the two populations is a requirement. As mentioned previously. 
Sukhatme (1960) undertook a critical evaluation of the power properties 
of a number of nonparametric tests concerned with differences in dis- 


persion of two populations and pointed out several deficiencies of which 
one should be aware. 


In the instance of several samples, Bhapkar (1961) devised a new 
nonparametric procedure to test the hypothesis of equality of location 
(median). He discussed the asymptotic power and asymptotic aie: | 
of his test in relation to other well-known ones that have been proposed: 


Additional References: Adcock (1960); Bhapkar (1961); Chapanis 
(1962). 


Contingency Tables 


Since July 1960, relatively little work appeared that involved the use of 
chi-square functions as compared with the amount of effort devoted durmg 
previous review periods. Much of what was reported regarding the use - 
chi square was in conjunction with the interpretation of data in cd 
tingency tables. In fact, considerable effort was directed toward pr? 
lems of the analysis of data in contingency tables. i 

To assist the research worker confronted with those practical problem 
of interpreting multiway contingency tables which the general arena 
does not answer, Lewis (1962) presented a general review of the ps 
pal methods of analysis along with a selection of computationally ie 2 
procedures, which may be modified for use with different sampliag P^? à 
as well as with theoretical parameters rather than with estimated one 
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Subsequently, in a comprehensive paper concerned with methods of com- 

paring contingency tables, Goodman (1963) showed that the standard 

L error formulas furnished by Lewis as well as by others for Stouffer’s 
method of comparing 2x2 tables fail to yield consistent estimates. 

Through making certain modifications in these formulas, Goodman de- 

" veloped consistent estimators, pointed to substantial differences in the 
j size of standard errors associated with his formulas and with those 
published previously, and offered generalizations of methods of com- 

paring several 2x2 tables. á 

For the situation in which the 2x2 or 3x3 parent distribution is 

bivariate normal, Tallis (1962) furnished a means for the maximum 

! likelihood estimation of correlation from contingency tables. In the 
instance of 2x2 tables in which the marginal totals are not fixed, Edwards 

(1963) showed that a measure of association should logically be func- 

tionally related to the crossratio. In applying a simple method to sub- 

| divide a large contingency table into a series of 2x2 tables, Daly (1962) 
obtained chi-square values for each of these tables—values that he con- 
cluded would furnish a feasible basis for a test of trend in most practical 


situations. 


Two papers appeared that were concerned with errors of classification 


of data placed into cells of tables. Making use of four extreme situa- 


| tions of classification in which two groups were being compared in terms 
es, Rogot (1961) noted that, relative to 


of their respective morbidity rat k ; 
the differences for a set of error-free data, the observed difference in each 
of the error situations will be smaller and concluded that the presence of 
| measurement error will serve to reduce the probability of detection of a 
genuine difference. When the parameter of interest for an alternative 
hypothesis is the difference between two true binominal probabilities 
that occur because of opportunities for misclassification, Walsh (1963) 
evaluated the efficiencies of tests for the circumstance in which the method 
| of classification is subject to random errors and for the situation in which 
no classification errors take place and then concluded that the proposed 

tests are valid for both events. 
Two tem papers were those of Bennett and Hsu (1960), 
he exact test for a 2x2 con- 


who furnished power cortour curves of t 1 : 
rnas tli Lx of Kincaid (1962) who prescribed a method involv- 


ing a chi-square expression for combining a set of contingency tables each 
consisting of two rows and an arbitrary (but fixed) number m of columns. 


Multiple Comparisons 


o the control of error rate in the performance of 
ts within the same experiment, Norton has de- 
III several of the major contributions to be 
an, in the 1961 and 1962 papers by 
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Ryan, and in the 1962 note by Wilson, attention also should be called La 
the critical discussion by Lubin (1962). In addition, four other pue 
were noteworthy. If prior to his investigation an experimenter can limi 
himself to a relatively small number of linear contrasts among a certain 
number of means, Dunn (1961) has demonstrated that the widths of the 
confidence intervals that can he established are shorter than those feun 
by the familiar procedures of Scheffé and Tukey. After describing severa 
factors that influence the number of significant findings in multiple sig- 
nificance tests, Block (1960) proposed a general method which he main- 
tained allows for an unbiased inference of a group of findings considered 
as a set and then suggested as well as illustrated a high-speed computer- 
based approach to be used in the generation of an empirical sampling 
distribution appropriate to a particular data matrix. Harter (1960a) 
furnished improved estimates of critical values for the now well-known 
and popular Duncan’s New Multiple Range Test, and Steel (1961) gen- 
eralized Wilcoxon’s rank-sum test in order that multiple comparisons foi 


each of several random samples of equal size given experimental treat- 
ments could be made against a control group. 


Concluding Remarks 


As a participant in the preparation of the chapters upon statistical 
methodology for the December 1957 and December 1960 issues of the 
Review, this writer has noted what appear to be a few changes ^ 
emphasis in the statistical literature from July 1, 1954, to July 1, 1963. 
During the period from 1951 to 1957, considerable stress was place 
upon development of nonparametric methods, perhaps at the expense as 
effort being directed toward augmenting classical parametric theory. Fron 
1957 to 1960, the amount of attention given to nonparametric statistics 


: : ra- 
appeared to decrease, and renewed interest in the development of par? 
metric procedures was evident. 


The most recent three-year period, 
consolidation and concili 
a unified view of statistic 
by the general and inclus 
1962), Pearson (1962), 
focalized but relatively c 
Fraser (1961), and S 


which seems to have been one of 
alion, has been marked by attempts to nm 
al inference and statistical analysis, as aer 
ive expository presentations of Birnbaum e al 
and Tukey (1962) as well as by the somet 
omprehensive contributions of Brillinger (196 zal 
avage and others (1962), Evidence of a pus 
rapprochement between nonparametric and parametric methods was QA 
parent in the papers by Anderson (1961) and Boneau (1962) and - 
others in which parametric and nonparametric procedures were yu 
pared. Attention in several articles has been given to problems. of es 
comparative power and efficiency of nonparametric and parametric t€? - 
The most noteworthy new direction is that furnished by Bayesian statistic? 


1 
the application of which affords an exciting and promising approach ir 
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research both to the design of educational and psychological investiga- 
tions and to the analysis and evaluation of the forthcoming data. 
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CHAPTER III 


Developments in Analysis of Variance and 
Design of Experiments 


DEE W. NORTON 


A 4 » ica- 

cern is the development and Interpretation of research methods se 
Sitar s 2. 3 i 

ble to studies in education and the social sciences, The second grou[ 
1s comprised of those who are planning and analyzing substantive r 


the following paragraphs. In doing so, the content researchers will ies | 
find it necessary to seek assistance from a member of the first group o 
readers, . "T" 

Contrary to the practice of Stanley and of Collier and Meyer in p 
Coverage of design and analysis in the December 1957 Review and in 
December 1960 Review, respectively, this chapter does not necessarily 
omit references cited in the chapters on Statistics in the Annual Review of 
Psychology. In 1960, the Annual Review changed to an alternate-yeat 
coverage of the broad topic of statistical theory; hence, the overlap cae 
this chapter and Lubin's (1962) contribution is not pronounced. an 
such overlap as has occurred does not seem undesirable. 


Philosophical Basis for Inference from 
Experimental Studies 


d 


Ln 
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would lead to different conclusions from sequential or nonsequential ex- 
periments is inconsistent with the likelihood principle. Appearances to the 
contrary, Birnbaum indicated that he was interested in an approach to 
informative inference that did not depend on the Bayesian principle of 
inverse probability. 

Birnbaum stated at one point that his findings would have radical con- 
sequences for everyday practice. Yet he later seemed to imply that con- 
ventional procedures would be adequate if thorough consideration were 
given in the planning of an experiment tà control both types of error 
for a specified type and degree of falsity of the hypothesis under test. 

The issues explored quite formally by Birnbaum had earlier been 
touched by Rozeboom (1960). Rozeboom criticized the widespread obei- 
sance to .05 significance levels and urged greater use of the likelihood 
function, confidence intervals, and inverse probabilities. He did not, in 
this reviewer's opinion, give adequate consideration to those situations in 
which an experiment is carefully planned with formal attention to power 


and sensitivity. . — 
In a somewhat different vein, Tukey (1962) reviewed at length the 
hniques. He empha- 


current status of analysis of variance and similar tec l e er 
sized the need for (a) more efficient techniques to deal with “wild” data, 
(b) further investigation of the potentialities of half-normal plotting to 


identify error sources, and (c) additional work on the transformation 


problem. , 
Additional References: Albert (1961); Neyman and Scott (1960); 


Pearson (1962). 


Mixed Model—Repeated Measurement Designs 


One Fixed Factor—Repeated Measures on the Random Factor 
i hwhile analysis of the difficulties in- 
Fus List) preni ey subject undergoes each of a fixed set 


volved with designs in which eac j i 
of treatments He presented expected mean squares for cases in which order 


j (b) randomized for 
of administration is (a) constant for all subjects, n or 
each niim and (c) counterbalanced with red ed "cn 
His emphasis on the fact that couñterbalano ng ica cii. 5 
ear. was well placed, 5 m T rm aul se unos effects 
e simple order beron es aepo did not pitis an entirely 
appear as treatment-order por". jase hts in the F-test of the 
ci erene bt. T40. If there were no treatment effect, it 
pans d be order-subject 


lá l here woul interaction effects paein 
r s ye ce an appareni 
cach imoka e these effects would consequently produ pp 


interaction. S e i i ems reason- 
ti If uch an argument 1S extended, it se 
er-group int 


Tr T as are in the 
to expec ] the same sources of variation as 
tin MSrz0 4 


REVIEW or EDUCATIONAL RESEARCH Vol. XXXIII, No. 5 


within-subjects residual term. This argument is important, since ifa 
justifiable test of treatment-order interaction is available, an experimenter 
can determine whether the attempt to increase precision by repeated ob- 
servation under counterbalanced orders has been successful. If a sequence 
effect is identified, it is only necessary, as Gaito clearly pointed out, to 
resort to the simple randomized analysis of the data observed on the 
first administration only. . 
Imhof (1962) questioned the validity of the usual test of no main 
effect of treatments in treatments-by-subjects design. As Box and other 
writers have pointed out in the past few years, Imhof noted that the ratio 
MSr/MSrg does not (except for the two-treatment case) follow the F 
distribution with (t-1) and (t-1) (s-1) degrees-of-freedom unless the 
within- and between-treatment dispersion matrix has diagonal elements o” 
and off-diagonal elements po. Imhof went on to show that for small 
degrees-of-freedom the actual risk of a Type I error in a nominal .01-level 
test of the hypothesis of equal treatment-population means ranged from 
ern of disparity among the nonzero char- 
acteristic roots of the dispersion matrix. He also provided tables for 


Two Fixed Factors— Repeated Measures on a Random Factor 
Under One of the Fixed Factors 


There is no brief title 
type considered here, In so 


and McNee (1960) provided an ex 
ing this design which will } 
workers. The authors showed 
of variances and equality of 
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assumptions. They also suggested an analysis for testing for parallelism 
of linear trends among the b treatment groups, but failed to emphasize 
that this test involves the implicit assumption that a linear trend is an 
adequate representation of the relationship in each b condition. 


Complex Mixed Designs Involving Repeated Measurements 


Imhof (1960) explored a type of design which seems likely to have 


considerable applicability to studies in education and the behavioral 
sciences but which has received little attention in the research literature. 
The design involves one fixed factor and two crossed random factors: e.g., 
each of a random sample of s subjects responds to each of a random 
sample of i stimuli under each of ¢ fixed treatments. Imhof showed that 
despite appearances from the table of expected mean squares, the ratio of 
the mean square for interaction between the fixed factor and one random 
factor to the mean square for triple interaction is distributed as F only 
when the conditions specified for a one-fixed—one-random factor design 
are satisfied in three distinct senses. Furthermore, he noted that no 
straightforward test is generally available for the main effect of the fixed 
factor. 


Stanley (1961b) discussed a similar type of d m : 
reliability study involving subjects, raters, and traits. He pointed out the 


implications for estimation and tests of significance of various | canes 
about treating the several factors as fixed or random and showe ; p to 
secure estimates of selected reliability coefficients. Ina later article, tan- 
ley (1961a) exhibited an analysis of a very complex design with de 
random factors (subjects and raters) and multiple measures per Su ject- 
rater combination. In addition, the raters and the subjects were each sub- 
divided on classificatory bases. Thus, within each type-of-rater by meee 
subject cell, the design was of the sort considered in his earlier article. 
Stanley showed how to form synthetic error terms for approximate tests 


f th i ts of the classificatory factors. 
i jir coda Graybill and Hultquist (1961) ; Harter (1961) ; 


Thompson (1962). 


esign as applied to a 


Multifactor Designs Without Repeated Measurements 


Randomized Block Designs 
rtant contribution to the extensive 
hich a stratifying factor is introduced to increase 
ht otherwise be a completely randomized one- 


factor design. He gave special attention to the effect of treatment-subject 
interaction of expected mean squares and choice of error terms in de- 
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Collier (1961) made a very impo 
literature on designs in W! 
the precision of what mg 
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signs in which each of the several blocks is comprised of subjects pre: 
sumably homogeneous on some variable related to the criterion measure. 
Within each block (level), random subgroups of subjects are assigned to 
each treatment. Making use of finite randomization models, Collier 
emphasized the need to conceptualize a mean score for each subject over 
all treatments, even though each subject is in fact measured only under 
the particular treatment resulting from random assignment. He showed 
that in terms of expected mean squares, proper error terms exist for the 
important case of fixed treatments, fixed blocks, and random subjects. 
In view of the stringent assumptions for exact F distributions in repeated 
measurement designs in which treatment-subject interaction effects are 


y of blocking in incom- 
less than all treatments occur 
yet been widely used in edu- 
cational and psychological research, probably because they inevitably 
; nce, assumptions about non- 
n industrial and agricultural 
tential applicability to investigatory studies in 

gns should receive more atten- 


Additional References: Addelman (1963) ; Potthoff (1963). 


rter (1962) presented a model which em 
corresponding to any point in the facto 
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combinations at random from the set of all possible combinations. Wilk, 
Gnanadesikan, and Freeny (1963) developed a method of estimating 
error variance in multifactor designs from the M smallest squared single 
degree-of-freedom contrasts out of K such contrasts available. In limited 
empirical checks with M/K ratios from 0.03 to 1.0 and K's from 10 to 
200, they found their estimate to be generally less than 5 percent too 
large. Their technique may prove quite useful in designs in which there 
is no effect assuredly free of treatment effects. 
Additional Reference: Ehrenfeld and Zacks (1961). 


Latin Square Designs 


Sheehe and Bross (1961) presented an interesting method for con- 
structing Latin square designs which are balanced with respect to (a) 
the number of treatments preceding every treatment, (b) the set of all 
predecessor treatments, and (c) the set of immediate predecessors. They 
did not, however, recommend any specific analysis of their designs which 
might take adequate account of the balancing. Ogilvie (1963) proposed 
a technique for removing time trend effects in a one-factor = 
measurement design employing balanced sets of Latin squares the 
method involves the tenuous assumption of no treatment-subject interac- 
tion; hence, as Gaito (1961) and Collier (1961) pointed out, it is likely 


to be of limited applicability. 


Special Topics in Design and Aualysis 


Multiple Comparisons and Separation of Populations 


; > yate. Í aki ti of 
The problem of controlling error rate in making mul iple ten : 
same experiment is not yet satisfactorily resolved, 
d Bayesian analysis of the two-decision and three- 
(EARS £6 ose a 
Duncan (1961) developed a “minimum-ayerage- 
ule for all possible two-treatment comparisons 
actor completely randomized experiment. His 
assumed constant for all 


significance within the 
Starting with a detaile 
decision t-test problems, 
risk" multiple decision r 
among n means in a one-f. 


; espe f (a) a loss-ratio 
rule requires specification of ta : e ro-tr 
antenas Frectie the relative seriousness of two-treatment Type I 
H . o 


and Type II errors, (b) the variance of the (presumably normal) a ida 

distxibaHon of the difference between population means, and (c) additive 

losses. In testing his rule, Duncan -found that the critical difference for 
ses. g 


^ i ent of the total number of means 
comparing means was almost independent ced ee 
= at was due largely, he emphasized, to the 


i ved—a ci nce th ] 
s of de Duns His findings will be heartening “ those ee 
feel as does this reviewer that most multiple ei per gni ures à a 
excessive concern with the risk of an n-treatment Type : error in spite : 
the fact that two-treatment differences are commonly of primary interest. 
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Wilson (1962) made a good case for setting error rates A id 
hypothesis basis, as has been the common though usually toii i" I: 
practice. He also emphasized the complete interdependence among i 
per hypothesis, per experiment, and experimentwise error rates = 
hence, the impossibility of choosing among them on purely logical grounds. 
Ryan (1962) reiterated his preference for setting rates for the entire 
experiment, but admitted the complexity involved in attempting to balance 
the risks of the two types of error with respect to the over-all hypothesis 
and the many embedded hypotheses. In an earlier article, Ryan (1960) 
presented his generalized method for making pairwise comparisons on 
statistics other than the mean. 

In order to keep an upper limit, say a, on the experimentwise error 
rate, the largest difference among n groups was tested at a nominal 
2a/n(n-1) level. If significant, then the two next-largest differences were 
tested at a nominal 2a/n(n-2) level. This relaxing of significance levels 
for tests made within a significant over-all test occurs in several multiple 
comparison procedures, For example, this relaxation takes place when 
a significant F-test is followed with t-tests in place of the use of the 
mathematically more appropriate Scheffé S-test, 

Lehmann (1961) considered the problem of selecting in a one-way 
design those treatments which produce a higher me 
associated with a standard treatment or with a stan 
menter. He not only indicated w 
general be explicitly included in th 
fairly simple procedures that min 
positive classifications. 


Additional References: 
(1962). 


an response than that 
dard set by the experi- 
hy the standard treatment should in 
1e experiment but also developed some 
imize the expected proportion of false- 


Bartholomew (1961); Fabian (1962) ; Hogg 


Additivity 


Additivity of effects is 
appropriate model for ma: 
suggested a modification of Tukey’ 


m Tow to row. He did not, however, make 
differences in slope are directly 
(1961) proposed a method for 
e Tukeyian sort can be e a 
sformation. Lubin (1961) offere 
cations of nonadditivity for inter- 
torial designs. He did not, hoyero 
dditivity that affects interpretation 
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in completely randomized designs and the nonadditivity that bears on the 
validity of F-tests in repeated measurement designs. 


Covariance Analysis 


ertaining to covariance was presented by 


An ingenious development p 
ed how one may improve on 


Robson and Atkinson (1960). They show 
the power of the usual test for homogeneity of linear regression by restrict- 
ing the alternatives against which this Hypothesis is tested. For a three- 
group situation they specified the following alternatives: (a) the three 
regression lines intersect at a common point with slopes proportional to 
intercept, and (b) the three regression lines intersect at different points 
with slopes a quadratic function of intercept. The among-regression- 
coefficients sum-of-squares was partitioned into two one-degree-of-freedom 
orthogonal components for regression on adjusted intercepts, and each 


component was tested against the pooled within-group deviation from 


individual regressions. 
Additional References: Cox (1962) ; Curnow and Sharpe (1962). 


Multivariate Analysis of Variance 

m of how to deal with 
ained on each experi- 
and Hughes (1962) 
variate analyses takes 


in the proble 


has been shown 
easures are obt 


Increasing interest } 
ple criterion m 


designs in which multi i 
mental unit (e.g. subject). Smith, Gnanadesikan, 
ions the use of separate uni 


noted that in such situati " 
no account of the correlations among the response measures. They com- 


mented that a high over-all risk of Type I error may result. However, this 
criticism seems a bit casual in view of the difficulty of specifying precisely 
what hypotheses are of interest. These authors illustrated this very point 
by exhibiting three distinct (and complex) criteria of equality among 


treatment effects and by noting the arbitrary nature of the choice among 
power of the classical T?- and W-tests 


them. Ito (1962) compared the : 
which are cite for dealing with multiple response measures. He tabled 
Jected classes of alternatives 


i y inst se 
approximate powers of.the two tests against se, 
for an 05 significance level and found little difference for samples size of 


100 to 200. 


Additional Re, (1960). 


ference: Bhapkar 


Assumptions in 'dwo-Group Designs 


senberg (1960) made an extensive investi- 
about differences between 


gation of ways to ™ : 
population wi er^ in testing the hypotheses of equal means in a two- 
group simple-randomized design. For the case 1n which one variance 1s 
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known to be greater than the other, their proposed statistic compared 
favorably in power with the modified t-test appropriate when the variano à 
ratio is known. For the case in which the variance ratio is completely 
unknown, the authors found it difficult to devise a statistic which would 
maintain a reasonably constant risk of Type I error. Gurland and McCul- 


real promise. 


Concluding Remarks 


A truly significant contribution to the 
and analysis was made by Winer (1962) 
designs with remarkable thoroughness, 
model underlying each design as well as the conditions that must be 
satisfied for valid statistical tests. His treatment of repeated measurement 
designs was particularly outstanding. Numerical 
were provided for most designs, 


growing list of books on design 
- He covered a great variety of 
He emphasized the structural 


illustrations or examples 
but interpretive discussion seemed rather 
limited in some instances. Winer also devoted more attention to complex 
incomplete designs than seems warranted by current needs in educational 
and psychological research. Edwards (1960) brought out a revision of his 
well-known text in design, with greater emphasis on individual compari- 
Sons, on trend analysis. and on graphical interpretation of interactions. 
Campbell and Stanley (1963) save valuable specific criticisms of a variety 
of designs used in research on teaching, Graybill (1961) produced a 
highly technical but much-needed integration of analysis of variance and 
covariance techniques with a more generalized approach to the study of 
relationships, 
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CHAPTER IV 


Regression and Correlation 


WILLIAM J, MOONAN and JOHN H. WOLFE 


o 


Ti CHAPTER reviews publications relating to regression and correla- 
tion theory that have appeared from June 1960 to June 1963. These sub- 
jects were last reviewed in the Review by Michael and Hunka in Decem- 
ber 1960. 


Regression Analysis 


Regression Models 


A large class of models descriptive of natural and behavioral phenomena 
can be characterized by an equation of the form y=F[x.8]+e, where y 
is the dependent variable of particular interest, F[x.8] is a function of 
some variables x and parameters £, and e is an error component. The 
model can be particularized in a bewildering number of ways by specify- 
ing various kinds of statistical properties of y, x, and e, and forms for 
F[x,8]. For example, y and x may be random or fixed variables, and 
F[x,8] may be a linear or nonlinear function (in the „usual algebraic 
sense) of the 8. If the F[x,8] is B-nonlinear, then statistical literature 
terms the model nonlinear, whereas if F[x,8] is x-nonlinear, the model is 
called curvilinear, Semantically, this distinction is not too clear, and it 
consequently has led to confusion. Nevertheless, it is very important. 

The equation y— ixi +82%2+e is both B-linear and x-linear; whereas 
Y — Bi, + Bore +Bsxite +e is B-linear and x-nonlinear because of the term 
Xx. The equation y— fixi t (81) (B2)x2+e is B-nonlinear and x-linear, 
but y Byxi?- (85) (Bs) xs - € is B-nonlinear and x-nonlinear. We have 
made these distinctions at some length because we find this classification 
system convenient for part of this review. If y — F[x.8] +e is looked at as a 
Prediction model, it is usual to assume that y, x, and e are random variables 
and that F[x,8] is B-linear. Also, in this case, if a particular set of values 
for x— X is given with X and e uncorrelated, and if the elements of e 
have expected values of zero and if they are independently distributed with 
Constant common variance, then the model is known as one of multiple 
regression. 

Particular care needs to be taken to select a proper regression model. 
Guidance in this matter is difficult to obtain; experience and professional 
training are the best aids. A much-needed general discussion of model 
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building and of methods of defining, solving, and testing models was 
presented by Ackoff (1962). Written from the operational analysis point 
of view, his book contained much useful information about models. Box 
and Hunter (1962) proposed a technique for building models bv a 
unique experimental procedure that statistically analyzes estimated re- 
gression parameters in order to develop, iteratively, better models. A more 
detailed exposition of the problems associated with estimation and tests 
of hypotheses of regression and with analysis of variance models was 
contributed by Graybill (1961). His book is an excellent introduction to 
linear statistical analysis. Graybill also analyzed the so-called functional 
model which relates the mathematical variables Y and X. For this case. 
F[X,8] should be X-linear. A very complete exposition of the general 
theory of regression analysis and of functional model analysis was con- 
tributed by Kendall and Stuart (1961). Plackett’s (1960) book gave a 
broad interpretation of regression analysis by defining it as the domain 
associated with the algebraic theory and with the numerical methods of 
least squares. Although mathematical, Plackett’s work does provide 


numerous illustrations of computational procedures suitable for desk 
calculators and computers. 


B-Nonlinear Models 


If F[x,8] is B-nonlinear, estimation problems are generally difficult, 
mathematically and computationally. A very general B-nonlinear model 
is the generalized logistic equation defined by Nelder (1961). This model 
has the exponential curves, the Gompertz equation, the logistic equation. 
and the Mitscherlich equation as special cases. These forms are associated 
with regression functions that have an upper and/or lower asymptote. 
Growth, learning, and many other educational processes have this limiting 
property. Therefore, these forms, which are very important, have general 
utility. It is known that x-nonlinear regression analysis is usually not ap- 
propriate for data which have single or double asymptotes. Stevens (1951) 
had previously presented computational procedures using a Taylor expan- 
sion for the Mitscherlich equation and its transforms, This computationa 
work was also generalized by Nelder. Turner, Monroe, and Lucas (1961) 
also gave estimating procedures for a general class of asymptotic models 
and considered their application to the path analysis of complex causal 
networks of variables. A modification of the Gauss-Newton iterative pro- 


cedure for solving simultaneous equations was applied and illustrated for 
B-nonlinear models by Hartley (1961) 


x-Nonlinear Models 


The general value of x-nonlinear forms for regression equetions is 
beginning to be recognized in applied educational research. Applications 
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have been prompted in part by the increased facility that computers have 
provided for making regression analyses. The x-nonlinear forms may 
involve many kinds and mixtures of functions of a basic set of independent 
variables. Power functions and crossproducts of variables have proved 
useful as predictors in research studies. Anderson (1962) investigated the 
problem of determining what polynomial degree should be included in 
a regression equation. He provided a procedure for ascertaining whether 
a higher power than that chosen is appropriate. Ward (1962) has been 
a resourceful advocate of x-nonlinear analysis. He emphasized that binary 
variables and such x-nonlinear forms as reciprocals, trigonometric func- 
tions, and ratios, which may be useful as potential predictors, are fre- 
quently neglected. In addition, he gave a general presentation of regression 
analysis viewed essentially from the general linear hypothesis standpoint. 
Bottenberg and Ward (1963) elaborated this concept and gave a refresh- 
ingly novel approach to the technique of research problem formulation. 


Accretion and Deletion Procedures 


From a few original variables, computers permit the generation of a 
large family of variables as predictors, although the ability to perform 
the computations does not mean that they should be performed, A 
researcher should have good theoretical or practical reasons for introducing 
new or transformed variables into his regression analysis. When this is the 
case, it is sometimes necessary to engage in some sort of variable selection 
program. Although many techniques are available for this process, most 
are variants of the so-called deletion or accretion procedures used by 
Horst and MacEwan (1960). They applied these methods to problems of 
multiple absolute and multiple differential prediction and indicated that 
the deletion procedure is better than the accretion since, at any stage, il 
utilizes to a greater degree the intercorrelations among the predictor 
variables. 

Larson and Bancroft (1963) contributed an important paper in the 
study of accretion and deletion procedures. Among other things, they 
characterized the two procedures in terms of hypothesis testing and showed 
that under certain conditions, the deletion procedure is superior if the 
predictions made involve the ‘same predictor scores. Nordg (1960) pro- 
vided a description of a computer process for a generalized as well as 
for the simple deletion procedure. 

Wolfe, one of the authors of this chapter, has studied polynomial regres- 
sion and correlations and has found that the order of the accretion of 
polynomial variables into regression equations depends upon the scaling 
of the components of the polynomials. He recommends that the compo- 
nents of polynomial variables be mean-deviated. This method prevents 
polynomials, which contribute little more than their individual component 
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variables, from entering the regression equation prematurely. This action 
has the additional merit of avoiding some ill-conditioning of the x-variance- 
covariance matrix. In general, one should be cautious about including 
polynomials of very high degree in a regression equation. Such terms may 
add relatively large contributions to prediction estimates, and unless the 
corresponding regression coefficients are very accurately estimated, large 
errors can be introduced into the predictions. In such circumstances, cross- 
validation procedures are especially important. 

The problem of shrinkage of the multiple correlation coefficient in cross- 
validation studies was investigated by Nicholson (1960). He defined an 
index of the measure of efficiency of prediction equations and gave its 
distribution function. The confidence interval of this index indicates the 
degree to which a regression equation derived from an original sample 
will predict in a crossvalidation sample. He also discussed the index in 
terms of relative sample sizes and predictors. 

A practical discussion of problems associated with the accretion and 
deletion procedures was furnished by Moore (1962). He pointed out that 
(a) at any stage of the accretion procedure. 
may appear before a significant variable; (1 
bination of significant predictors may not be discovered; and (c) the 
application of both procedures to the sai 


me data will not necessarily yield 
the same set of predictors. Moore suggested that the choice of the accre- 
tion or deletion method ma 


y depend upon the number of predictors 
desired to be retained in the final regression equation. If K is the number 
to be retained and V is the number available, he suggested that the accre- 
tion procedure should be used if K is very much less than V. Alternatively. 
if K is only slightly less than V, the deletion procedure should be used. 
Although this seems to be good advice from a computational standpoint. 
the method does not obviate the difficulties arising from a complex rela- 
tionship among the variables, There seems to be no method presently 
available that minimizes computation and guarantees the optimum selec- 
tion of predictors, Perhaps there never will be. Although the accretion 


and deletion methods seem to be reasonably efficient, they are subject to 
dangers with which sound research and professional judgment will have 
to cope. 


Freund, Vail, and Clunies-Ross (1961 
and Goldberger (1961) discussed the estimations and significance test 


resulting from a two-stage stepwise least-squares accretion procedure. In 


the first stage of this procedure, a f-linear model is analyzed to determine 
the usual regression coefficients for the first set of predictors, e.g., xi. In 


the second stage, the residuals, ¥—B.x,=y,, are obtained; »nd a second 
regression equation is derived using a second set, x», as predictors, Bz 85 
coefficients, and y;, the residuals, as criterion, The authors showed that, 
compared to the usual one-stage analysis of y=f,x,+Box2+e, the esti- 
mated parameters 8, and B, and the tests of significance are both biased. 
and they described the degree and conditions of the biases. 
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Computer Analysis 


Many computer programs are available for performing regression 
analyses for both 8-linear and £-nonlinear models. A surprising number 
of these programs are redundant. performing the same work, or small 
variations of it, on the same computers. A good computer regression 
program will provide for flexible input. an efficient matrix inverter and 
flexible output to meet a great variety of demands. It is especially impor- 
tant for the programs to allow for the generation of x-nonlinear variables 
from a set of fundamental x variables. These problems were ably dis- 
cussed by Healy (1963). Experience has shown, however, that the most 
general programs are not necessarily the best to use because of the details 
required to set up particular problems. A good computer library will include 
a variety of specific regression programs as well as a very general program. 

A special program for handling missing x data seems essential. An 
approach to this problem was suggested by both Buck (1960) and Walsh 
(1961). Their methods involve the estimation of missing values of x by an 
independent regression analysis. Since this technique may necessitate a 
large computational effort, it seems appropriate only if the missing x can 
be estimated very accurately. Although the resulting x-covariances are 
unbiased, the x-variances need to be corrected. A simpler procedure is to 
compute the regression equation from a correlation matrix of all avail- 
able data and to use the largest sample-size estimate of the means and 
standard deviations of x. The biases associated with this procedure, which 
are not known. are related to the factors causing missing x. 


Miscellaneous Topics 


Within the space allotted for this chapter, it is impossible to cover 
adequately the many important topics associated with regression theory 
and practice. The number of references in the current regression literature 
is very large, and such important topics as correlated errors, weighted 
regression, and applications to applied science have been neglected. The 
following sample of references in this section points to the wealth of 
theoretical and practical information available to interested researchers. 

The subject of time series is a fundamental one in the educational 
process. This fact was pointed out by Brown and Meyer (1961), who 
characterized a simple learning model in terms of an exponential smooth- 
ing process. Their work considers the analysis of a smoothing model for 
Which x may involve polynomials of arbitrary degree. The smoothing 
process permits forecasts to be made and their errors to be assessed. 

Bottenberg and Christal (1961) made an application of regression 
technique to criterion cluster analysis. They devised a calculation procedure 
Which successively clusters a set of criteria so that, at any stage, there is 
a minimal loss of predictive efficiency of the clustered criteria by a set of 
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predictors. Such a procedure, which is of great importance for military 
and academic assignment problems, is a clever application of a deletion 
process which reduces the number of predictors in sets of variables rather 
than one at a time. 

Rao (1962) developed an important procedure in the theory of regres- 
sion to apply to selection problems. He reported the theoretical develop- 
ment and the calculation method for constructing a linear function of x 
which will maximize the correlation with a criterion y so that the same 
linear functions correlated with other y's will be non-negative. With this 
methodology, an equation can be obtained which will insure optimum 
selection with respect to the mean of Yı under the condi 
of other criteria will not diminish. 
computationally simple. 

Box and Watson (1962) inv 
of predictor variables upon 


analysis to non-normality of the criterion variable. They found that the 


tion that the means 
For two criteria. the procedure is 


-wide, the list is likely to contain 
eaders. Although the bibliogra- 
it does contain many important 


Correlation Analysis 


Measures of Association 


One of the outstanding articles in the past three years was by Goodman 
and Kruskal (1963), the third Paper of a series concerned with measures 
of association for contingency tables, They presented the sampling theory, 
maximum-likelihood estimators, and variances for their indexes Aas Avy Ay Ys 
and others. The coefficient àa, Which Goodman and Kruskal credited to 
Guttman, might be described as the decreas 
dicting a from b, divided by th 
information about b. A, is defi 


ures Aq and A, lies in their asymmetry; since Asép, the prediction of a 
noa b may be better than the Prediction of b from as The coefficient A is a 
symmetrized combination of A, and àb» and y is a measure of association 
appropriate for ordered categories, closely related to Kendalr's tau and 
equal to Yule’s Q in the 2x2 case, 

Another type of asymmetry of association arises from so-called impli- 


cative relations. For example, Success in college requires above-average 


intelligence, but high intelligence is not sufficient for scholastic achieve- 
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ment. Otherwise stated, prediction is not equally accurate at both ends of 
the predictor, and there is heteroscedasticity of regression. Maruyama 
(1962) discussed the problem of discovering and describing implicative 
relations and made a computational comparison of 20 indexes of associa- 
tion for 15 different 2x2 contingency tables representing different degrees 
of implication. Among the measures compared were chi square, Yule's Q 
and Y, Goodman-Kruskal's y and A, and 13 new indexes proposed by 
Maruyama. 


LI 
a 


Measures of Correlation 


Several papers evaluated the performance of various rank correlation 
coefficients, such as Spearman’s correlation of ranks, rą; Kendall’s tau. 
Tk; and the Fisher-Yates coefficient, r; (the product-moment correlation of 
the expected normal deviates corresponding to the ranks). Fieller and 
Pearson (1961) reported that the Fisher z-transforms of rs, rx, and ry are 
all nearly normally distributed. but that the Fisher-Yates coeficient has 
less sample variance and more discriminating power than the other coeffi- 
cients. Farlie (1960) presented formulas for the mean values and asymp- 
totic relative efficiencies of Spearman's rank correlation and Kendall's tau 
lor a very general class of continuous bivariate distributions. 

Several generalizations of the point-biserial correlation coefficient have 
been proposed. Olkin and Tate (1961) considered the multiple correlation 
coefficient, the partial correlation coefficients, and the canonical correla- 
tions for the case of several dichotomous variables with a multinomial 
distribution mixed with several continuous variables with conditional mul- 
livariate normal distributions. Das Gupta (1960) defined the point-multi- 
Serial coefficient for the case of a single continuous variable having different 
conditional normal distributions for different values of a many-valued 
discrete variable. 

Effects of selection, errors of scaling, and errors of measurement were 
discussed by Carroll (1961). He showed that when two variables have 
differently shaped marginal distributions, the maximum possible correla- 
tion between them will .be less than one; he also illustrated a general 
method for finding the maximum possible correlation. Carroll noted that 
errors of scaling and errors of selection can distort the marginal distribu- 
lions in similar ways, but that the proper adjustments for these two effects 
may be precisely opposite to each other, Graphs giving the product-moment 
Correlation corrected for restriction in range were presented by Michael 
and others (1962). 
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CHAPTER V 


Factor Analytic Methodology 


PHILIP R. MERRIFIELD and NORMAN CLIFF 


Sikes 1960, increased attention to epistemological questions has charac- 
terized the course of research into factor analytic methods. While interest 
in the continuing problems of communality, rank, and rotational options 
has stimulated a number of investigations, the major acceleration empha- 
sized both the design of studies and techniques of factoring in other than 
the common-factor space underlying matrices of correlations between 
variables (R-technique). The definitive text by Harman (1960) lucidly 
presented the matrix algebra necessary to understand the common tech- 
niques of factor extraction. Examples of several rotational procedures are 
presented and compared. 

In this chapter, the three major sections are concerned with (a) design 
of studies, (b) decisions within a single study, and (c) selected applica- 
tions of methods. 


Design of Studies 
In part because of the increased accessibility of computers, the stream 
of substantive factor analyses typical of earlier periods threatened to 
become a torrent as this flow was augmented by the contributions of many 
whose major purpose seemed to be to “do a factor analysis.” On the other 
hand, the prospects of easier computation than formerly may have con- 
tributed to the increased interest in P- and Q-techniques as well as in the 
other types of models potentially applicable to psychological problems. 


Hypotheses and Interpretations 


. By combining four basic components—stiniuli, responses, occasions, and 
individuals—Coan (1961) concluded that a total of 24 different possi- 


bilities for factor analytic studies was available and suggested ways to 
analyze these possibilities. Bechtoldt (1961) differentiated between ex- 
ploratory and confirmatory analyses and pointed out that common cri- 
teria such as minimum rank and simple structure may be more appropriate 
to confirmatory than to exploratory investigations. Bechtoldt’s data con- 
sisted of scores on 17 tests, each of which was purported to be a univocal 
measure of one of six abilities, administered to more than 400 subjects. 
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He divided the group randomly into two samples, each consisting of more 
than 200 subjects. Factors extracted by centroid. principal axes, and 
multiple group methods showed little resemblance to each other, whether 
or not communalities were used. Differences between samples within 
methods were also marked. However, when guided by prior knowledge of 
the content of the tests, oblique rotations of all sets of extracted factors 
produced a remarkable similarity in the rotated solutions. He concluded 
that a high value of the index of reproducibility of rotated factors was 
to be expected in a well-designed confirmatory study. 

In a later discussion? Bechtoldt (1962) differentiated between the ap- 
propriateness of factor analytic and regression models for testing hy- 
potheses. He contended that the inclusion of criterion variables with 
predictor variables in a factor analysis confuses the relationships. Further- 
more, noting the desirability of linear and experimental independence of 
predictors. Site from the other, he suggested extraction by multiple group 
rather than centroid procedures. He maintained that the former procedure 
avoids the experimental dependence arising in the latter approach because 
all the variables are used in the centroid procedure. It must be pointed 
out that useful results have been obtained when predictors and criteria 
have been used in the same factor analysis and that experimental inde- 
pendence is as much affected by exclusion of variables. as in multiple 
sroup extraction, as it is by their inclusion. 

Cattell (1961) discussed similarity in methods of measurement as a 
Source of factors in addition to the usual sources of covariance arising 
from similarity of traits and from correlation of second-order factors, In 
à study in which hypotheses were derived from physical similarities among 
Variables, Cattell and Sullivan (1962) demonstrated the degree to which 
oblique rotations produced factors closely resembling the hypothesized 
Stoupings. The reader may find cause for concern in the high correlations 
of the “error” factor with one “systematic” factor in each of the samples 
analyzed, 

In a long and thorough discussion of the interpretation of rotated f. 
in oblique solutions, Cattell (1962) presented 6 of the approximat 
Matrices representing possible dimension-variable relationships, He 
mended the use of the primary factor pattern as the basis of interpr 
instead of the reference vector structure matrix more commonly r: 

* emphasized the need to interpret factors as influences on w 
Variables in the study are dependent rather than as mere mathematical 
definitions, For future use, Cattell urged the consideration of a matrix 
indicating the relationship between a, (first-order) factor and av 
When the effects of higher-order factors are partialed from both the 
and the variable. 

It would appear that many varieties of analysis, within the limitations 
of R-technique, are possible; hopefully, one of them will be most ap. 
Propriate for the investigation of the factor analytic psychologist’s 
multivariate problem. 
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P-, Q-, and R-Techniques 


Two papers highly critical of P-technique (correlations between various 
measures taken on a single individual over many different occasions) 
were published. Anderson (1963) questioned the validity of the assumption 
of independence among error or unique scores. He advised distinguishing 
in the analysis between independent (controlled or directly manipulated) 
and dependent (merely observed) variables and between those variables 
subject to measurement error aad those not. F 
of these distinctions for the interpretation of factors and emphasized the 
value of including time and various functions of time in the analysis. As 


an alternative to P-technique, he proposed a type of analysis which appears 
to be a combination of time-series analysis ( 


correlation and gave references to econom 
dealing with similar or related problems, 
P-technique on similar grounds, He too emphasized the lack of independ- 
ence of observations and the possible presence of serial correlations or 
lagged serial correlations as well as the resultantly large number of 
relationships that must be investigated. 

In his review of applications of Q-methodology, Wittenborn (1961) 
differentiated between Q-technique leading to ipsative measures and Q- 
factor analysis based on correlations between persons over items. He 
pointed out that ipsative scales are necessary to meet Stephenson’s criteria 
for Q-methodology in psychology and noted some of the difficulties in 
correlating ipsative measures, Sundland (1962) spoke to a crucial point 
in the correlation of lists of attributes such as those commonly used in 


obtaining Q-sorts, where Synonyms or closely associated labels are in- 
cluded in the same list. The synonyms, considered as replications over 
which the covari i 


Te developed the implications 


autocorrelation) and multiple 
etric and statistical literature 
Holtzman (1962) criticized 


€ computed standard errors of the correlations. 


or of the trait estimates, true? values. This notion 


would appear to warrant further inve 
the reliability of trait score 


comparison with the usual interactive scores, 
factored separately for the thr 
normative-ipsative scores resembled each ot 
the normative or normativ 
the experimentally ipsative scores 
were carried out using R-technique 
interperson correlations based 


In his exploration of the relationships between Q- and R-techniques, 
Broverman (1961) pointed out that the results depend on whether rows or 
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columns are standardized in the raw-score matrix. However, in a later 
article, Broverman (1962) noted the lack of identity of the factors and 
inferred a lack of any relationship—a misstep noted by MacAndrew and 
Forgy (1963) and subsequently clarified by Broverman (1963). Address- 
ing the problem of a suitable correlation coefficient for Q-analysis. Guilford 
(1963) compared several possibilities and recommended a new statistic 
appropriate for use with item responses or other dichotomous raw data. 
The statistic is the Pearson-r for numbers that are standard scores of 
persons relative to item means and item standard deviations. Guilford noted 
that further investigation of the distribution properties of the new statistic 
was needed. 


Other Techniques and Models 


Horst (1961) reviewed the problem of determining the linear combina. 
tions of variables from one set which correlate maximally with variables 
in another set and extended the procedures to more than two sets of 
variables, Several implications for factoring were drawn. Harris (1962) 
Pointed out some relationships between Rao's canonical factor analysis 
and Guttman's image analysis. In a somewhat similar article, Jóreskog 
(1962) advocated the use of residuals as determiners of the rank of the 
matrix and presented procedures for estimating factor scores and factor 
loadings. His residuals appear to be related nonlinearly to Guttman’s anti- 
images (which are defined as unity less squared multiple correlations). 
Guttman (1960) presented five matrices based on the images of the varia- 
bles and concluded that parsimony, in the sense of the least number of 
Parameters needed per observed variable, does not require the reduction 
of the R-matrix to its minimum rank. . 

As a method for using factor analysis to study preferences of individuals 
for objects, Coombs and Kao (1960) discussed the distance between person 
and object as obtained in the method of multidimensional unfolding as a 
Suitable measure for factor analysis. They conjectured the appearance of 
an extra “social utility” dimension corresponding to the distance of each 
Person or object point from the centroid of all such points in the same 
Space. Their conjectures were partially confirmed by two hypothetical 
examples. Cliff and Ross (1962) showed that the conjecture applies 
exactly when squared distances are used. . 

Four papers dealt with Guttman s simplex and circumplex models. Kaiser 
(1962b) not only showed that paired-comparison procedures may be ap- 
plied to correlations between variables to order them on an underlying 
Continuum but also provided an index of goodness of fit of the matrix to 
à simplex. Jones (1962) discussed several methods of analyzing correla- 
tion matrices having simplicial form. A different kind of process was 
implied. in each method. Although the intertrial correlations from learning- 
Curve data are shown to exhibit a sort of simplex, Jones’s procedures 


513 


REVIEW OF EDUCATIONAL RESEARCH Vol. XXXII, No. 5 


contained no guards against Type II errors. Moreover, he seemed to 
mistake the apparent simplicity (skill) of practiced motor performance 
for simplicity in the structure of individual differences between learners. 
Humphreys (1960) presented data showing that correlations between 
scores on the same variable measured at different times in the individual's 
life or at different stages of training have the form of the simplex. How- 
ever, he pointed out that matrices that simply lock like simplexes can 
sometimes be generated by two common factors of the usual kind. Closer 
inspection is necessary in order to decide whether a simplex exists. Meth- 
ods for analyzing a given set of correlations into simplex components are 
included. Gibson (1963) provided an analysis of the circumplex, in which 
the correlations, rather than decreasing steadily from the main diagonal 
as in a simplex, first decrease and then increase toward the off-diagonal 
corners. Guttman proposed a pattern of factor loadings leading to such a 
correlation matrix. Gibson furnished a method for recovering a pattern 
of factor loadings from a correlation matrix. 
indeed a circumplex. 

Gibson (1962a) discussed the problem of assigning new data to pre- 
viously found latent classes. Using a procedure similar to the Dwyer ex- 
tension in factor analysis, he provided a chi-square test for the goodness 
of fit of the appropriate joint proportions. In an earlier paper, Gibson 
(1960a) addressed the possibility that latent classes may be interpreted 
as nonlinear relationships between factors—at least for certain kinds of 
data. 

In a direct attack on the 
(1962) presented methods 


if one may presume that is 


problems of nonlinear factor analysis, McDonald 
for finding out whether a given set of obtained 
factors actually include a smaller number of independent continuums—the 
remaining factors being composites of first- and second-order functions 
of the independent factors. 

Additional References: Battig (1962) ; Bock (1960) ; Du Bois (1902) : 
Foa (1961); Horst (1962); Marks, Michael, and Kaiser (1961a, b): 
Miller. Eyman, and Dingman (1961) : Rust, Jones, and Kaiser (1962). 


Decisions Within a Single Study 


Having decided upon the hypotheses he wishes to investigate, the Lyr? 
factor analyst typically proceeds to collect his data and thus bypasses à" 
important aspect—the development of suitable tests. Unfortunately, n? 
articles dealing primarily with the psychological aspects of tests to Br 
factor analyzed could be found. Presuppositions regarding the factoria 
complexity of tests as well as hypotheses regarding their factcrial validity 
may well influence the investigator both in his choice of communalily 
estimates—a choice which thereby influences the rank of his reduced 
matrix—and in his decisions regarding not only the obliqueness of the 


rotated factors but also the most acceptable simple structure in the rotate 
factor space. 
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Communality and Rank 


It is often surmised that the factoring of different coefficients of correla- 
tion, although based on the same raw data, will result in different values 
of communalities and perhaps in differences in the rank of the reduced 
matrix. The factorial complexity of a test is related both to its communality 
and to the rank of the matrix in which it is analyzed. 

In his discussion of the effects of disparate marginal distributions. 
Carroll (1961) pointed out that wide variations may induce “difficulty” 
factors which may or may not be orthogonal to other factors of greater 
interest. If the investigator is willing to assume that the variables involved 
have a normal distribution in the population from which the samples at 
hand were drawn, then. Carroll concluded. the tetrachoric coefficient is 
least biased. Henrysson (1962) developed the relationship that loadings 
on the first centroid of a matrix of interitem phi (or tetrachoric) coeffi- 
cients are biserial coefficients of correlation of item with factor (or test). 
He suggested the use of factor analysis of items in the development of 
homogeneous tests. 

Now one may question whether person scores on homogeneous tests, 
which are developed by the application of Henrysson’s findings, would 
have a normal distribution. There is scattered evidence that the scores of 
Persons on highly homogeneous tests may have a skewed rather than a 
normal distribution. Skewness appears to be accentuated if the same per- 
Sons’ responses to items were analyzed in order to group items into 
homogencous tests. The next consideration is whether normality would 
exist in the population to permit the use of the tetrachoric coefficient even 
if the sample distributions were markedly skewed, or whether the more 
Conservative assumption resulting in the use of phi coefficients is pref- 
erable, as in the study reported by Merrifield, Guilford, and Gershon 
(1963), ù 

Although the use of unity as a diagonal entry is required if one wishes 
to obtain uncorrelated factor scores, as pointed out by Overall (1962), 
Spurious factors may be engendered by using unity as an initial diagonal 
entry if the true values of communality are greatly less than unity, Ap- 
Parently the interrelations of communality, rank, coefficient of correlation. 
and homogeneity of tests require further study. . 

Gibson (1961) suggested the partitioning of the correlation matrix into 
asymmetric submatrices and derived communality estimates from the inter- 
submatrix correlations and thus extended Tucker’s earlier procedure. How- 
ever, because of the danger that one or more of the submatrices may be 
Singular—especially in small empirical matrices—the procedure should 
not be empl¢yed blindly. 

Comrey (1962) reported his successful use of a new method of ex- 
traction. He described an estimated vector, the inner product of which 
With a row of the correlation matrix is the numerator of a ratio. The 
denominator of this ratio is the sum of squares of factor estimates for all 
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other variables in the matrix. He used, as an initial estimate of factor 
vector values, the highest correlation in the column. The computation re- 
quired for the estimates to converge is more than that needed for centroid 
analysis but less than that required for principal components. If the factor 
vector values are considered as beta weights, the numerator of Comrey s 
ratio becomes the squared multiple correlation. The interesting combina- 
tion of this lower-bound estimate of communality and the long-used 
“highest in column” estimate may have the merits of both estimates. 
Kaiser (1961) provided ar extension of Guttman’s lower bound con- 
cerning the number of common factors in a correlation matrix to the case 
in which communalities may lie in the closed interval from zero to unity. 
In a penetrating article, Maxwell (1961) reviewed the centroid, the 
principal-component, and the maximum-likelihood methods of extracting 
factors and presented comparative data on their application. Tests of sig- 
nificance for the number of factors were reviewed; most of these tests 
incorporate the chi-square statistic. Maxwell questioned the efficacy of 
rotation of factors—primarily on the grounds that no adequate sampling 
theory underlies the notions associated with simple structure. Further, he 
decried not only the lack of concern with such relevant variables as 


maturation, sex, and environment but also the inadequacies of samplings 
of persons. 


Rotational Options 


In contrast to the welcome accorded the deus ex machina of analytic- 
rotational methods via computer several years ago, the current climate 
encourages the growth of procedures for using prior information in reach- 
ing rotational solutions. Fortunately, this utilization, which can be pro- 
gramed, results in a combination of mechanical and informational 
approaches, (These terms do seem more indicative of current thinking 
than do the older terms, analytic and subjective.) The dangers of hy- 


pothesizing after the fact persist—a circumstance which is a natural, not 
a scientific, attribute, 


Lawley (1960) presented a procedure for.computing rotated factor 


loadings directly from the correlation matrix. He used the device of speci- 
fying in an appropriate matrix where the zeroes were to occur. Concerning 
the invariance aspect of rotation, Cattell ard Baggaley (1960) developed 
an index for matching factors in two or more studies. The index consti- 
tutes an application of the hypergeometric (or Fisher’s exact) probability 
function, and its effective use depends in large part on the choice of a 
significance level for factor loadings. To its credit, it avoids the anachro- 
nism of computing Pearson-r coeflicients on loadings intentionally rotated 
to a highly skewed and truncated distribution. As mentioned previously, 
Horst (1961) started from the case of two sets of factors (for example, 
factor matrices for the same variables from two studies) and derived a 


516 


December 1963 Facror AnaLytic METHODOLOGY 


solution which is the same as Hotelling’s “most-predictable criterion.” 
If there are more than two sets, however, the methods must be revised 
somewhat, but remain essentially similar in that they are derived from 
a least-squares criterion. Ahmavaara (1963) took a different approach, 
mainly because he required that each of the transformations relating one 
set of data to another belong to the same group, in the strict mathematical 
sense, 

Cattell and Gorsuch (1963) added random components to an established 
simple-structure matrix and maintained the communalities and relative 
sizes of the factors. Then they applied an arbitrary orthogonal rotation to 
both factor matrices, and finally they rotated each matrix to simple struc- 
ture, They concluded that substantially better simple structure obtained for 
the “real” than for the “randomized” data. Cattell and Dickman (1962) 
used artificial data to investigate the results of orthogonal and oblique 
rotations of factors derived from a number of measurements on a number 
of balls. Factor analysis of the data yielded the expected factors. Oblique 
rotation resulted in what would be judged to be a good simple-structure 
matrix in which the number of zero loadings was significant by Bargman’s 
test. Although neither qualitatively satisfactory nor statistically significant 
in terms of hyperplane counts because of the presence of many inter. 
mediate.sized loadings, the orthogonal solution (varimax) resulted in 
factors yielding the same interpretations as those in the oblique solution. 

In two papers, Gibson (1960b, 1962b) addressed the problem. of ob. 
laining orthogonal from oblique transformations. In his first contribution, 

€ gave a geometric comparison of orthogonal and oblique factors and of 
their interrelationship; in the second, he showed how to find an orthogonal 
transformation having a least-squares fit to a given oblique transformation. 
Presumably, the orthogonal matrix would be applied to the unrotated factor 
Matrix to obtain an orthogonal rotated solution. Gocka (1962) presented 
lé results of an empirical Hare on the optimal weights for 
ur: y ic rotation method. 

In K paa ar criteria, Saunders (1961) extended ,Quartimax 
Notions to the oblique case and called the resulting procedure oblimax.’ 
n their development of a program to maximize the number of zero entries 
in each hyperplane, Cattell and Muerle (1960) noted that the oblique 
factors meeting their “maxplane criterion required visual improvement 
to meet all psychological criteria. Eyman, Dingman, and RE sm (1962) 
Concluded that biquartimin produced more meaningful results than oblimax 
for the oblique case and that varimax produced more readily interpretable 
results than quartimax for the orthogonal case. Although they could not 
Obtain a satisfactory orthogonal solution for their data by the Zimmerman 
Eraphic procedure, they did achieve a satisfactory Thurstone oblique 
Solution for comparison with the mechanical procedures, Their criteria 
were simple structure and psychological meaning. Marks, Michael, and 
Kaiser (1960) applied the varimax criterion to a matrix of aptitude 
variables for which a satisfactory informational solution had been achieved. 
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As a result, the number of factors having significant loadings was greatly 
reduced, and the factors themselves represented composites of those ob- 
tained in the informational solution. Merrifield, Guilford, and Gershon 
(1963) reported a similar finding with less distortion than in the pre- 
viously mentioned study. The varimax procedure, which may have been 
required to rotate too many factors, did result in larger groups of tests 
and in more factors having but one significant loading than did the 
graphical solution. In the use of varimax, it seems important that "pe 
factor be represented by three or more tests and that the number o 
centroid factors or principal components to Fe rotated be specified 
correctly. : 

In their extensive discussion of the rotational problems arising from 
sampling of variables, Guilford and Zimmerman (1963) concluded that 
the investigator has a responsibility to use information regarding the pre- 
vious performance of the measures in the battery, the factorial complexity 
where it is known or suspected in a test, and the degree to which a factor 
is represented in a battery. Informational solutions are preferred to 
mechanical; and orthogonal, to oblique. 

Additional References: Burroughs and Miller (1961): Gibson (1961); 
Howard and Cartwright (1962); Howard and Gordon (1963); Jarrett 
(1961); Lumsden (1961); McQuitty (1961a, b) ; Saunders (1962). Be- 
cause of limitations of space, computer programs for factor analysis can- 
not be discussed in detail. Selected examples may be found in Dixon 
(1962a, b) ; Hurley and Cattell (1962) : Saunders (1960); Vandenberg 
and Lamphiear (1960) : and Wexler (1960). 


Selected Applications of Methods 


The citations in this section w 
Kaiser (1962a) indicated |} 
principal components: rot 


ere selected for methodological interest. 
how component scores might be obtained from 
ated factors were not considered in his discus- 
sion. Kaiser and Dickman (1962) developed score matrices and mage 
tion matrices by sampling from a known population of factor scores and 
correlations; this type of effort should be helpful in studying the effects 
of sampling on factorial invariance, à 

Dingman and McIntyre (1962) showed that the loadings of raters on 
factors derived from interrater correlations could be used to indicate when 
different raters were using different criteria and illustrated how this 
information might be applied to improve selection of employees. . 

In comparing factor analytic procedures with profile analysis, Nunnally 
(1962) demonstrated that the factor analysis of raw-score distance matrices 
would lead to useful profile information. He noted that factor analysis as 
a technique is not restricted to the analysis of correlation matrices but 
that it applies to any Gramian matrix, Nunnally also recommended the 
application of factor analvsis for obtaining clusters of homogeneous pro 
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files. Horn (1961) suggested the use of tests developed in the context of 
discriminant-function analysis to ascertain the significance of various 
profile statistics. Cliff (1962) and Weitzman (1963) went even further 
afield. Cliff showed that factor analysis applied to matrices of raw data 
may be used in testing the appropriateness of a variety of explicit mathe- 
matical theories. Weitzman presented methods for analyzing trials-by- 
subjects learning data into idealized learning curves. One advantage of 
this approach is that the total learning curves, rather than single charac- 
teristics of them, can be compared for different treatment groups. He 
provided procedures for estimating different components of the learning 
curves as well as statistical tests for comparing different learning curves. 

Additional References: Haertzen (1963); Harris (1962) ; Sawrey, 
Keller, and Conger (1960). 


Summary 


Contributions to factor analytic methodology since 1960 were reviewed 
under three headings: design of studies, decisions within a single study, 
and selected applications. There was increased attention to the specifica. 
tion of the purposes of factor analysis and to the rules for the interpre- 
tation of obtained factors. Several investigators explored the persistent 
problem areas involving communality, coefficients of correlation, and rank. 
If the present trend continues, new rotational methods should evolve as 
combinations of mechanical and informational approaches—methods that 
not only add computer capabilities to those of the factor analyst but also 
stimulate him to develop inferential procedures consistent with both 
sampling variability and existing knowledge. 
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CHAPTER VI 


Design of Sample Surveys in Education 


FRANCIS G. CORNELL and EUGENE P. MC LOONE 
D 


A urnovcn applications of modern sampling methods are growing in 
education, fields other than education remain in the forefront both in 
methodology and in applications. Few sample surveys in education ex- 
hibit the sophistication found in studies carried out in other fields. Appli- 
cation of sample surveys to educational situations and areas has been 
largely by persons from disciplines other than education. Consequently, 
much of the technical literature on the subject deals with matter outside 
the field of education. 


General References 


The importance of sample surveys for educational research and the 
developments in this field since the detailed presentation by Sitgreaves and 
Solomon in the December 1957 Review justify a separate chapter in this 
Issue. This chapter supplements a section of the one by Michael and 
Hunka in the December 1960 Review. In the Encyclopedia of Educational 
Research, Cornell (1960) described the major methods of sampling. 


Nontechnical Sources 


Sample survey methods apply to the problem of enumeration—in other 
Words, to the determination of how many or how much. There are now 
excellent sources of relatively nontechnical information on sampling theory 
and methods, which should be useful to researchers in education who are 
Dot themselves mathematical statisticians. Necessary steps are clearly 


defined by Slonim (1960) as follows: 
l. Determine, as precisely, as possible, the population or universe to be 


Studied. 
2. Set up a sample frame, that is, a list or file of the units in the 


Statistical universe. 

3. Designate the information desired from the survey. 

4. Determine the desired degree of precision in terms of permissible 
error (tolerance) and acceptable risk (confidence). 


Economics of School Finance, Office of Education, U.S. Depart- 
The interpretations and opinions expressed are those of the 
Office of Education or tbe Department of Health, 
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5. On the basis of the desired degree of precision, select the appropriate 
sample design. 

6. Design the reporting form. 

7. Conduct a small-scale pretest of the reporting instrument. 

3. The report form may be reconstructed on the basis of pretest results. 

9. Undertake the survey as planned. 


While essential to a sample survey, these steps do not have to be fol- 
lowed precisely in this order. Obviously, there can be overlapping, and one 
step may precede another. ' i 

The Phi Delta Kappa Symposium on Research Design and Analysis 
outlined sample design methods and explored difficulties in sample surveys 
in education. The chapter by Kish (1961) is an excellent introduction to 
sample design theory. The three difficulties of the reaction of the public, 
the training of researchers, and the technology in education were cited in 
a chapter by Cornell (1961). Acceptance of sample surveys by the public 


and by research workers in education should develop from the popular 
treatment of sampling by Slonim. 


Development of Statistical Theory 


In a general sense, sample surveys are a special application of the central 


problem of theory and methods of statistical inference, namely, regulating 
the degree of uncertainty. The idea of randomization, originally introduced 
by Fisher, by which the statistician introduces an element of chance into 


the selection of observations and thus creates a random experiment is 


shown by Dalenius (1962) to be a link between several branches of design 
for collecting and analyzing such observations as experiment, acceptance 
sampling, Monte Carlo sampling, and sample survey. In reviewing develop- 
theory and methods, Dalenius pointed to 
rior to that time. Though efforts continue 
& variance models, increasing attention is 
and to methods that take into account non- 
ome to be called mixed error models. This 
ors is a development of great importance in 
more often than not a mixed error model is 


being given to survey theory 
sampling errors, what have c 
attention to nonsampling err 
educational Surveys, since 
needed. 


statistician should begi 
Hansen, Hurwitz, and Mad 


a mathematical statistician experienced both in the theory and in the 
practice of sample survey methods. 
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Among the books that have appeared in the period covered by this 
REview is a revision of Cochran’s (1963) excellent text and a new book 
on sample design in business research by Deming (1960). The applica- 
tions of the procedures described by Deming to the managing and operating 
of schools are legion. 


Sample Surveys in, Education 


The use of sample surveys in educational research is growing. Mathe- 
matical statisticians have been employed by the U.S. Office of Education 
for assistance in sample design since 1960. 

In conjunction and in cooperation with various groups, the National 
Science Foundation has conducted a number of studies of interest to edu. 
cators. With the Bureau of Social Science Research, it carried out a study 
of the 1958 college graduating class and a follow-up investigation of the 
1958 graduates enrolled in graduate school. In association with the Na- 
tional Opinion Research Center (NORC) under a cooperative research 
Project in the U.S. Office of Education, it completed a study of career 
choices and decisions of 1961 graduates (Davis, 1963). Presently with 
NORC, the Foundation is studying the cooperation of graduate education 

Y various fields. In cooperation with the National Science Teachers As. 
Sociation (NSTA), the National Science Foundation (1963) completed 
Studies of the salaries, education, and class loads of science and mathe- 
matics teachers in the secondary schools. The employment of mathemati- 
Clans in industry and their education and training were also subjects 
Studied by the National Science Foundation (1962). A sample of 70,000 
Persons, who were selected by occupation from the 1960 Census of Popu- 
lation, is being conducted by the Foundation to examine the training and 


education of in various occupations. 
persons in 
As described in a recent report by Anderson and Hoofnagle (1960), 


the U.S, Department of Agriculture is continuing its study of school food 
Services on a sample basis. Similarly, the U.S. Department of Commerce, 
Bureau of the Census (1958, 1959, 1960a,b, 1962a,b,c, 1963a,b,c) con- 
tinued its studies of school enrollment, school employment, school finance, 
and illi 
ive a in which sample survey methods were used were issued by 
the U.S, Office of Education. Approximately 25 additional studies, in which 
Various sample designs are being employed, are in progress. Sample sur. 
veys published by the Office dealt with teacher turnover, the status of 
eginning teachers, programs in the schools for home economics, the 
°rganization of elementary schools in large cities, and the status of adult 
education, -— 3 
. The appendixes of many of the governmental publications cited above 
Indicaie sampling methods used and problems encountered. A wealth of 
illustrative material is available from these sources. 
EOS 
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Use of Available Lists in Sampling Procedures 


Lists of school districts from the Bureau of the Census prepared for the 
1952 and 1957 Census of Governments have been the principal source for 
sampling of school districts. In U.S. Office of Education studies, red 
and Bain (1959) depended upon the list of school districts from the 1952 
Census of Governments: Mason (1961) used the 1957 Census of Govern- 
ments listing. Coon (1962) for the U.S. Office of Education. Flanagan and 
others (1962) for Project Talent, and the National Science Foundation 
(1963) utilized the basic information in the Directory of Public Secondary 
Day Schools 1958-59 by Ramsey (1961). In his study of large city ele- 
mentary schools, Dean (1960) relied upon the 1950 Census of Population 
to locate cities to be studied, 

It was not until 1959, when the U.S. Office of Education began a com- 
prehensive study of school district expenditures, that a list of school sys? 
tems was specially obtained for an Office sample study. The results of this 
listing, which are presented in a report of the U.S. Department of Com- 
merce, Bureau of the Census (1960b) ; show a reduction in school systems 
from 52,913 in 1957 to 12,129 in 1959, In 1952, there were 69,755 school 
systems in the United States; and in 1962, only 37,438. The Woodward 
(1961) survey of adult education statistic 
tion that can be obtained by sampling 
regular reporting system. 

The National Science Foundation (1963) employed the Register for 
Junior and Senior High School Science and Mathematics Teaching Per- 
sonnel 1960-61, compiled by NSTA. NSTA used the basic data in the 
Office directory for both private and secondary schools, As the report by 
the Foundation indicated. cluster sampling error and additional complica- 


tions in the collection of data would have been introduced without use of 
such a list. 


The multistage sam 
scribed by Anderson 


$ is a good example of informa- 
but is not readily derived from a 


pling of the U.S. Department of Agriculture, de- 
and Hoofnagle (1960), for data on school food 
services began with the Bureau of the Census list of school districts. Lists 
of school plants were obtained by the Bureau of the Census from a sample 
of school districts. After a determination of which school plants conducted 
a lunch program, schools were selected on an area-probability basis. In- 
voices from these schools for two months were obtained. The sample was 
subdivided into six approximately equa] subgroups of schools with in- 
voices of deliveries taken in a rotating, pattern in orde: to provide coverage 
during all months of the year. Because of sample size and of the relatively 
uniform nature of the National School Lunch Program, there was a smaller 
sampling error for schools participating in the national program than 
there was for nonparticipating schools. 

When the first step in a sample survey entails the establishment of a 
universe listing of school systems or secondary schools or other units to 
be sampled, many of the advantages of sample surveying are lost. Estab- 
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lishing and maintaining a universe list is of major importance in con- 
ducting sample surveys in education. When old universe listings must be 
used, the nonsampling error is increased because of out-of-scope systems 
and other problems. Reports of most of the sample surveys previously 
cited have indicated such difficulties. Reorganization of school systems as 
evidenced by the previously indicated reduction in numbers of school 
districts and the continuing establishment of new schools during the past 
decade emphasize the need to identify and to maintain school universes. 


Sampling as a Means of Identifying 
Information in a Census 


Sampling plays an important role in the identification of information 
in a census. Two publications from the 1957 Census of Governments de- 
serve mention, In a U.S. Department of Commerce, Bureau of the Census 
(1958) publication. detailed classes of financial information were obtained 
on a sampling basis. This action was motivated by the desire to obtain a 
maximum amount of information within a limited budget. It is far easier 
to control nonresponse errors for a small group of school districts than 
for the entire universe. The U.S. Department of Commerce, Bureau of the 
Census (1959) provided estimates of the market value of property in the 
United States by state. It would have been impossible to have obtained this 
information in almost any other manner. In its investigation, the U.S. 
Office of Education used a similar procedure in attaining almost complete 
universe coverage. In several studies, extremely small schools were sampled 


in the early 1950's. 


Teacher Surveys 


During the past several years, the National Education Association, Re- 
Search Division (1960), along with the U.S. Office of Education, has been 
the primary source of information about education on the national level. 
Its efforts to use the power of sampling theory da sending 1,000 question- 
haires to a sample of the universe of 1.3 million teachers in the United 


States was evidence of advance in this area. In an interesting paper. 
s wa 


Robinson and McCall (1961) discussed Fésponse Tales sud vatis sape 
Mates for teacher surveys conducted by the National Education Association. 

heir position supported what has been advocated for a decade or two 
on such surveys: (a) sampling to achieve ated m avoiding non. 
response, and (c) controlling uncertainty which can be related to random 


esign. 


Project Talent 


nstitute for Research, Project Talent is an 


Cond ; the American I 
ucted by th aracteristics of schools and their effects 


example of a large-scale study of ch 
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upon students—a study which exemplifies the necessity of eco Siig 
sample theory and technique in education. At one and the same mS 
requires what the statistician calls a multiparametric model, requir 2 
a model for subpopulation studies, and represents a mixed model. Initiate 


of uncertainty was difficult. In thé investigation, 440,000 students in grades 
ere tested with instruments 
; achievement, attitudes, interests, 


log i ; and personal plans. A stratified, 
random cluster sample (by school, by state, 


holding power) was used primarily to obtain a "representative? sample 
with a degree of efficiency. Because of the large number of students in the 
study, it should yield much valuable information and be useful in blue- 
printing some of the variances with which the Statistician must deal in 
ndary school populations on similar dimensions. 


Furno's Study 


An additional study which has involved sampling is significant because 
it is one of the few investigations which is concerned with some of the 
day-to-day budgeting, finance, and operational 
Furno (1962) undertook a statistical 


ex, a school system can compare cost data with 
h the average of its class of district. Since 


Limitations of Educational Applications 


available with existing variance models 
3. Inappropriate variance formula in computing confi 
598 


dence intervals. 
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A common fault is the use of a cluster sample and a simple random error 
variance. The study reported by Robinson and McCall (1961) showed 
the standard error for mean age of teachers as computed by the unre- 
stricted random method to be .32 and by an appropriate cluster formula 
to be 1.85—a considerable difference. 

4. A “good” probability sample, but unsuccessful coverage and lack of 
treatment of nonresponse and other biases 

5. Sampling on the basis of an acceptable probability model, but not 
computing or reporting observed precision 

6. Use of sophisticated sample design for which an error-loaded ques- 
tionnaire or test on ambiguous or multidimensional attributes introduces 
psychometric errors of unknown but undoubtedly large magnitude 

7. Attempts to reproduce a population or subpopulation of schools, 
teachers, or pupils in many dimensions accompanied by the introduction 
of impractical multiparametric design problems. 


Bibliography 


Wiutam S. The Market for Food in Public 
Agricultural Marketing Service, Marketing 
C.: Government Printing Office, January 


ANDERSON, KennetH E., and HoorFNAGLE, 
Schools. U.S. Department of Agriculture, 
Researoh Report No. 377. Washington, D. 

. 58 pp. " " 7 

Cocunaw, Ten G. Sampling Techniques. Revised edition. New York: John Wiley 
& Sons, 1963. 413 pp. ; 
oon, B : Economics in the Public Secondary Schools. U.S. Department 
of Hehi Eaei, and Welfare, Ote of Bauran Circular No. 661. Wash- 
i 'C.: Government Printing Office, 1962. pp: ; 
aa poen o ado Methods." Encyclopedia of Educational Research. 
(Edited by Chester W. Harris.) Third edition. New York: Macmillan Co., 1960, 


Dp. 1181-84. 


OnNELL, Francis G. "The Sampling Problem in Educational Research." Research 


Dest nda d Annual Phi Delta Kappa Symposium on Educational 
Voie tition he tani 0. Collier, Jr., and Stanley M. Elam.) Bloomington, 
Ind.: Phi Delta Kappa (Eighth St. and Union Ave.), 1961. Chapter 3, pp. 65-95. 
ALENIUs, Tore. "Recent Advances in Sample Survey Theory and Methods.” Annals 


P istics 33: 325-49; June 1962. 
bg De atea, Siati s y: climes. Career Decisions and Educational Plans 
Deed 4 is Department of Health, Education, and Welfare, Office of 
oe do oin Research Project No. 1194. Chicago: National Opinion 
esearch Center, University of. Chicago, 1065,0 Fi Organization. U.S. D 
EAN, Stuart E. Elementary School Administration ane -T8 Bulletin 1960. Neat. 
ment of Health, Education, and Welfare, ome Cw pA ulletin 1960, No. 11. 
7 h P t Printing ce, s y " 
& Sons, 1960. 517 pP. ; ican High S j 
: ] l . Stud: the American High School. Project Talent 
PLANAcan, Jony C., k i 5 arua oy qz Department of Health, Education, and 
Weer ph Series, D ation, Cooperative Research Project No. 226. Pittsburgh, Pa.: 
: 2 á 62. 368 pp. " 
rutahrersiy of Pittsburgi, Lot Loole at the Cost of Education Index.” School 
Management 6: 171-72, 174-78; January 1962. 


7 ESO arch Design and Analysis: Second 
Isu, Leste, “The Design of Sample Surveys. beset T Research. (Edited b y Raymond 


A ji i Ed > 
Orga Phi Delta Kaphi YM Elam.) Bloomington, Ind.: Phi Delta Kappa (Eighth 
St. and Union Ave.), 1961. Chapter 2, pp. 45-63. 


529 


kea ; ES 
Review or EDUCATIONAL RESEARCH ] ol. XA MIT, No 


. The Beginning Teacher: Status and Career Orientations. U.S. 
accord ed Education, and Welfare, Office of Education, Circular No. 614. 
Washington, D.C.: Government Printing Office, 1961. 196 pp. ie Schools 
MASON, WARD S., and Bain, ROBERT K. Teacher Turnover in the FE ehoa 
1957-58. U.S. Department of Health, Education, and Welfare, Office of ucation, 
Circular No. 608. Washington, D.C.: Government Printing Office, 1959. 38 pp. at 
NATIONAL EDUCATION Association, Researcu Division. “Small-Sample Techniques. 
NEA Research Bulletin 38: ead December Eus 
NartoNAL SciENCE FouxpaTIoN, Employment in 
Industry and. Government. 
82 pp. 
NATIONAL SCIENCE 
Characteristics ai 
1963. 45 pp. . i 
Ramsey, Lean W, Directory of Public Secondary Day Schools 1958-59. Publication 
No. OE-20031. Washington, D.C.: Government Printing Office, 1961. 163 PP- 
Rosinson, GLEN, and MCCALL, CHESTER H., Jn. Some Statistical Findings from 
Nationwide Teacher Polling. Paper presented at the annual meeting of the American 
Piel, Aasociation, New Mon City. Washington, D.C.: Research Division, 
ational Education ssociation, 1961. 15 pp. 
tonin; Momus James. Sampling in a Nutshell. New York: Simon and Schuster, 1960. 
145 pp. " 
U.S. DEPARTMENT oF Commence, Bureau or THE CENSUS. Finances of School Districts. 
an Census of Governments, Vol. 3, No. 1. Washington, D.C.: Government Printing 
ce, 1958. pp. 


U.S. DEPARTMENT OF Commence, Bureau or tHe Cen 
in the United States. 1957 Census of Government 
Government Printing Office, 1959. 145 pp. 

U.S. Department or Commence, Bureau or THE C 
Survey: 1950. Technical Paper No. 4. Washington, D.C.: the Bureau, 1960. 99 pp. (a) 

U.S. Department oF Commerce, BUREAU or THE Census. Public School Systems 
in 1960. State and Local Government Special Studies, G-SS, No. 44. Washington. 
D.C.: the Bureau, 1960. 31 pp. (b) 


U.S. DEPARTMENT OF Commerce, Bureau OF THE Census. School Enrollment and 
Educational Attainment, for the United States: 1960. Current Population Reports 
Supplementary Reports, Series PC(S1)-20, Washington, D.C.: the Bureau, 1962. 
6 pp. (a) 

U.S. po OF Commence, BUREAU OF THE Census. School Enrollment: October 
1961. Current Population Reports—Population Characteristics, Series P-20, No. 117. 
Washington, D.C.: the Bureau, 1962, 11 pp. ( 


b) 
U.S. DEPARTMENT OF COMMERCE, BUREAU OF THE 


Finances in 1961, Governmental Finances 
G-GF61-No.1. Washington, D.C.: the Bureau, 1962, 25 pp. (c) 5 
U.S. DEPARTMENT OF Commerce, BUREAU OF THE Census. City Employment in 1902. 
me Employment: 1962, Series G-GE62-No. 2. Washington, D.C.: the Bureau. 

1 pp. (a) 


U.S. DEPARTMENT or Commence, Bureau or THE Census. Estimates of Illiteracy, by 
States: 1960. Current Population Reports, Series P-23, No. 8. Washington, D.C.: 
the Bureau, 1963, 2 pp. (b) ? 

U.S, DEPARTMENT OF Commence, Bureau or THE Census. State Distribution of Public 
Employment in 1962. Government Employment: 1962, Series G-GEG2-No.l. Wash- 
ington, D.C.: the Bureau, 1963, 23 pp. (c) ^ 

Woopwanp, MARTHINE V. Statistics of Public School Adult Education 1958-59. U.S. 
Department of Health, Education, and Welfare, Office of Education, Circular 
No. 660. Washington, D.C.: Government Printing Office, 1961. 49 pp. 


rofessional Mathematical Pork in 
Washington, D.C.: Government Printing Office, 1962. 


FOUNDATION. Secondary School Science and. Mathematics Teachers: 
nd Service Loads. Washington, D.C.: Government Printing Office, 


Us. Taxable Property Values 
s, Vol 5. Washington, D.C.: 


US. The Post-Enumeration 


Census. Summary of Governmental 
in the United States: 1961, Series 


Additional References 


Anperson, T. W. “A Modification of the Se 
Sample Size.” Annals of Mathematical S. 


530 


quential Probability Ratio To Reduce the 
tetistics 31: 165-97; March 1960. 


December 1903 DESIGN OF SAMPLE SURVEYS IN EDUCATION 


BECHHOFER, Rosert. “A Note on the Limiting Relative Efficiency of the Wald 
Sequential Probability Ratio Test.” Journal of the American Statistical Association 

.. 959: 660-63; December 1960. 

CoatE, Anstey Jọ and STEPHAN, Frepentck F. "The Case of the Indians and the 
Teen-Age Widows.” Journal of the American Statistical Association 57: 338-47: 
June 1962. 

Davin, Marti 
Received W 

_ Association : 

Fax, Ciexe-Ten; Muter, Mervis E; and Rezvcia, Ivax. "Development of Sam 


“The Validity of Income Reported by a Sample of Families Who 
tance During 1959." Journal of the American Statistical 


1, Ivan P. “Sampling with Varying Probabilities Without Replacement: Ro- 

tating and Non-Rotating Samples.” Journal of the American Statistical Association 
, 98: 183-201; March 1963. " s 

Feros, R. L; Kramer, C. Y.; and Cruxies-Ross, C. W. “Joint Estimation of the 
Parameters of Two Normal Populations.” Journal of the American Statistical 
_ Association 57: 446-54; June 1962. "n 

Fieninc, Myron B. "On the Use of Correlation To Augment Data." Journal oj the 
American Statistical Association 57: 20-32; March 1962. . 

Grasser, GEmALD J. "A Distribution-Free Test of Independence with a Sample oi 
Paired Observations." Journal of the American Statistical Association 57: 116-33; 


March 1962. (a) à 
G "| Gand J. “Variance Formulas for the Mean Difference and Coefficient of 
icentration." Journal of the American Statistical Association 57: 648-54; Sep- 
„tember 1962. (b) 
70ODMAN, LEO A, “Statistical Methods 
American Statistical Association 56: 841-68; December 1961. 
lanver, H. Leon. “The Use of Sample Ranges in Setting Exact Confidence Bounds 
for the Standard Deviation of a Rectangular Population. Journal of the American 
Statistical Association 56: 601-609; September 1961. iliti ji 
mice n T MAINE "Sampling with Unequal Probabilities and With- 
out Replacement.” Annals of Mathematical Statistics 33: 350-74; June 1962. 
lorrroinc, Wassity. “Lower Bounds lor the Expected Sample Size and the A yerage 
isk of a Sequential Procedure.” Annals of Mathematical Statistics 31: 352-68; 


June 1960 : 
AEGER, Cano, M., and Pennock, Jean L. "An Analysis of Consistency of Response 
in Household Surveys.” Journal of the American Statistical Association 56: 320-27; 
June 19 F i 

SINDATIL y^ Means and Totals from Finite Populations of 
J m rican Statistical Association 57: 61-91; March 


for the Mover-Stayer Model." Journal of the 


s K. "Estimation of 


known » Journal of the Ame 
Kisi, Lesne. “Efficient Allocation of a Multipurpose Sample.” Econometrica 29: 363- 
85; July 1961 : r 
Kisu Di bd sd 1, N. Krisunan; and PILLAI R. Krisuna, “The Ratio Bias 
in Kane d peer "American Statistical Association 57: 863-76; December 


1962, "m Curses 

Koni Henprix S. “Regression Analysis in ae Oe Journal of the 
Athoriene Que ct Association 51: 590-606; September 2 i 

sone EE Statistical A T.s and Torr, C. W ^m he Use ol Sample Quasi-Ranges 
in Setting ‘Confidence intervals for the Popula ion Standave, Deviation: Journal 
of the American Statistical Association 56: 260-12; June, le s 

Mapansky ALpert, “Tests of Homogeneity lor Corel SRED es." Journal of the 
American Sorar ett f iation 58: 97-119; March 3. . 

Mona an Statistical dee ANQUIST, Joun A. "Problems in the Analysis of Survey 

ps S AM D rongiaL Journal of the American Statistical Association 58: 415-34; 


June aito f : 
\ lo. MN d Serm, V. K. “Randomized Rounded-Off Multipliers in Sampling 
Theory.” Joulnal of ‘the American Statistical fsueiuon su Ese dune D. 

7 : " 4 i i Stra! [ Jb; 
'ETO DE PASCUAL, JOSE. “ Unbiased Ratio Estimators SH TL pling urna 


i talist iation 56: 70-87; ; , 
n of the American Statistical Aso sn Journal of the American Statistical 


^J, Des. “On Matching 
Association 56: 151-55; March 1961. 


531 


Review or EDUCATIONAL RESEARCH Vol. XXXIII, No. 5 


PEETER: = 

, and VıtHayasar, C. “Unbiased Componentwise Ratio Estimation. 

o FA spas Statistical Association 56: 350-58; June 1961. j the 

Ross, Aran. “Variance Estimates in ‘Optimum’ Sample Designs.” Journal of 
‘American Statistical Association 56: 135-42; March 1961. . lcu dem 

SawpiFonp, PETER J. “A New Binomial Approximation for Use im Samne I P 
Finite Populations." Journal of the American Statistical Association 55: 718-22; 

ember 1960. , — ati 

i Ester B., and Dwyer, PAuL S. "Computation with Multiple k-Statistics. 
Journal of the American Statistical Association 58: 120-51; March 1963. " ure 

SieceL, SinNEv, and Tukey, Joun W. * Nonparametric Sum of Ranks Kreeedunp 
for Relative Spread in Un * Journal of the American Statistical 
Association 55: 429.45; Septemb>r 1960, 

SuKHATME, Bakr j 
Journal of the 2; September 1962. 

Weiss, LioNEL. “Tests of Fi servations Falling in the 
Shortest Sample Spacings Determined by Earlier Observations.” Annals of Mathe- 
matical Statistics 32: 838-45; September 1961, 

Weiss, Lion Minimize the Maximum Expected Sample 
c3 Journal of the American Statistical Association. 57: 551-66; September 
962. (a) . 

Weiss, LIONEL, “A Sequential Test of the Equality of Probabilities in a Multinomial 
Jg UU Journal of the American Statistical Association 57: 769-74; December 
1962. (b) ; 

Wittiams, W. H. “Generating Unbiased Ratio and Regression Estimators.” Bio- 
metrics 17: 267-74; June 1961. 

WirLiAMS, W. H. “On Two Methi 
Journal of the American Statistical Association 57: 184-86; March 1962. (a), 

Wittiams, W. H. “The Variance of an i i ost-Stratified Weighting. 

Journal of the A sociation 57: 622-27; September 1962. (b) 


532 


A 


b 


CHAPTER VII 


Scaling and Measurement Theory 


HAROLD SCHIFFMAN and SAMUEL MESSICK * 


Derne THE past few years, quantitative psychology has developed from 
its earlier concerns with psychophysics and mental testing into what 
amounts to the general application of mathematics to the solution of 
Psychological problems (Gulliksen, 1959). Several new areas have come 
under the purview of mathematical psychology—for example, statistical 
inference theories of signal detection, scaling models incorporating mathe- 
matical descriptions of individual differences, and computer simulations 
of cognitive processes. 
: Disparate fields of psychology have been coalesced through the matura- 
Hon of mathematical theories, with a strong trend toward axiomatics 
appearing among the theoreticians in the field (Luce, Bush, and Galanter, 
1963a, b). These theories extract the invariances of structure and form 
Tom domains which apparently differ on the surface. Thus, for example, 
9ne can see concepts of information theory permeating both theoretical 
and empirical considerations within psychophysics and scaling. One finds 
game theory being incorporated into signal detection models, and both 
of these approaches have their impact on mathematical learning theory. 
hese advances in theory, however, have not usually been matched by 
equally extensive empirical research, in either experimentation or valida- 
tion, The theoretical aspects of psychometrics thus appear to have out- 
Istanced by far the verification of empirical consequences of these 
theories, Considerable theoretical emphasis, for example, has been placed 
9n the metric properties of the numbers generated by a theory—an 
emphasis making explicit the level of measurement (ordinal, interval, or 
Tatio) consistent with the model. However, this survey of research indi- 
Cates that neither the numbers themselves nor the transformations allow- 
- on them are put to much Tw er PI 
"inally, i nized that, unlike physics in which the measur- 
ing pu Hume js pee cases of more general laws, measurement 
tec niques in psychology usually have coordinate status. They frequently 
imply Psychological theories of the same order as the hypotheses they are 
ms to evaluate. Thcrefore, when these measurement techniques are 
Ihterposed between a hypothesis and the raw data, their filtering action 
May only aliow information conformable with the axioms of the measur- 
1 device to become visible to the experimenter (Messick and Ross, 
2). 
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History of Psychometrics 


Psychometrics generally marks its beginning with the pu pi 
Fechners Elemente der Psychophysik, and Psychometrika devote s 
commemorative issue jointly to the one-hundredth anniversary of this 
event and to the twenty-fifth anniversary of the founding of the Psycho- 
metric Society. In this commemorative volume, a ‘series of papers on 
various aspects of the history of psychometrics appeared. Dunlap (1961) 
critically evaluated the status of quantitative psychology and argued with 
great force that present-day psychometrics is "extremely narrow in its 
concept of what constitutes the development of quantitative rationale for 
the solution of psychological problems” (p. 69) and that it has “become 
complacent, perhaps authoritarian, and unimaginative in . .. approach 

27 (p. 70). Boring (1961) commented upon some contributions 
of Fechner as the “inadvertent founder of psychophysics.” Guilford 
(1961) discussed trends in psychometric publications and reviewed some 
active issues in the field. Gulliksen (1961) provided a substantive history 
of scaling and discussed the laws of comparative and categorical judgment. 
multidimensional scaling, and other recent developments in measurement 
in the context of establishing scientific laws of human behavior. He hoped, 
in conclusion, that these methods would not be considered ends in them- 
selves, but would be used to investigate the properties of judgments and 
their functional relations with other variables, 

An informative historical introduction to 
in a scholarly article by Ramul (1960). From his concrete illustrations of 
eighteenth century scientists trying to come to grips with metrical meth- 
ods for dealing with human beings, one can appreciate the difficulties 


encountered when the concepts and categories of thought are not suffi- 
ciently definite for such an approach. 


s sned 
psychometrics also appeared 


Psychophysics 


It is well documented that measures of sensory magnitude obtained by 
category rating and direct magnitude estimation methods (Stevens, 1958. 
1960) produce two different scales that are nonlinearly related to each 
other (Stevens and Galanter, 1957). Galanter and Messick (1961), in- 
vestigating this problem experimentally, found that a processed category 
(successive intervals) scale was a logarithmic function of the magnitude 
scale. Rationales for the logarithmic relation have been advanced by Eisler 
(1963), by Helm, Messick, and Tucker (1961), and by others. 

The anniversary issue of Psychometrika also included a discussion of 
recent developments in psychophysics, Keith Smith (1961) discussed 
certain issues in the presentation of stimulus material to the subject. 
which he calls “stimulus programming.” Swets (196la) reviewed the 
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theory of signal detectability and discussed its implications for psycho- 
physics. Stevens (1961) continued his long-standing controversy with the 
Fechner-Thurstone approach and concluded that ratio scales arrived at 
by direct methods “have greater demonstrable usefulness . . . than any 
Fechnerian scale yet devised” (p. 16). 

However, in a thoughtful article on some of Stevens’ assumptions in 
psychophysics. Warren and Warren (1963) considered the problem of 
whether quantitative responses about stimuli do or do not directly reflect 
the subject’s mates of stimulus intensity. They offered, as an alterna- 
tive to Stevens’ view, a physical correlate theory in which psychophysical 
judgments are considered to be estimates of physical magnitudes asso- 
ciated with varying degrees of stimulation. Loudness judgments, for 
example, would be based upon experience with distance judgments in 
spatial localization (Warren. 1963). In support of this theory, they found 
that judgments of half loudness and half brightness were each equivalent 
to judgments of doubling the distance from the source. If true, this theory 
has powerful implications that would necessitate a re-examination of 


Psychophysical procedures and analysis. 


Signal Detection 


In psychophysical experiments, the subjects’ responses to identical 
stimuli vary, and this variation occurs systematically as the experiment 
proceeds. A consideration of this learning aspect in repeated psychophysi- 
cal judgments plus the observation that payoff functions are important 
in determining thresholds are the major points of departure for theories 
of signal detection (Goldiamond, 1962; Green, 1960: Swets, 1961a). 

. Signal detection theory was developed from statistical decision theory 
'n an attempt to specify optimal behavior in situations requiring a choice 
between alternative hypotheses on the basis of an observed event (Swets, 

anner, and Birdsall, 1961). Technical procedures were then developed 
lo describe the behavior of subjects attempting to discriminate patterns 
With both information and noise from those patterns which contain only 
Noise, This decision theory approach generalizes the conventional notion 
of a threshold as a fixed limit to the concept of a criterion range of accept- 
ance (Atkinson, 1963: Swets, 1901b). A further advance in this formula- 
lion over the classical psychophysical methods is the insistence that during 
the Course of an expcriment the subiect learns and that this learning will 
affect the probabilities of choice (Luce. 1963). 


Information Theory 


Since its continuing development by Fisher, Shannon, and others, in- 
?rmation theory lias had an impact on psychology which goes far beyond 
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the relatively small number of studies attempting to incorporate and to 
test certain of its consequences, . 
Fortunately, a book is available which provides an extensive frame of 
reference for considering information theory in relation to psychological 
theory and experimentation, This book, Uncertainty and Structure as 
Psychological Concepts, by Garner (1962), is the outcome of several 
years work on the uses of informatio 
an introduction to the mathematical 
problems and experiments in univari 
uncertainties. Empirical research 
placed in a theoretical context; e 


n theory in psychology. It provides 
basis of information and considers 
ate, bivariate, and multidimensional 
is reviewed in each section and is 
xamples are redundancy in language. 
tegory scaling. 

muli is related to the uncertainty of the 


led by informational analysis. In this 
connection, Alluisi (1960) reviewed recent studies of 


explicated the usages of information me 


; he extended the argument by noting 
lation matrix are related to variance and 


Multidimensional Scaling 


models assumed fixed "real" scales of prefer- 


ence or judgment, (b) that stochastic models implied that the subject 
acted as if he consulted a table of 


entary 
tuation, and (c) that certain no 
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tion of the properties of a wide variety of tasks (Shepard, 1960), Shepard 
(1962a) concluded that in each case the subjects judgment or response 
was a function of proximity among the stimuli. Shepard then considered 
the intuitive isomorphisms that are expected to hold between the psycho- 
logical idea of proximity and its mathematical representation. For example, 
the notion that A is close to B implies that B is close to A, given the 
metric requirement-that the distance between two stimuli be symmetric. 
Since it is by no means certain that proximity measures are related to 
distance measures by a single fixed rule’ for all domains, Shepard con- 
sidered the general case of multidimensional scaling where distances are 
required only to be monotonic functions of proximities. 

Thus, in Shepard's (1962a, b) formulation of multidimensional scaling 
there is no specified distance function; only a monotonic relation between 
proximity and distance is kept invariant. The objectives of the analysis 
are to determine (a) the minimum number of dimensions of a Euclidean 
Space such that distances in the space are monotonically related to the 
initial proximity measures, (b) a set of orthogonal coordinates Íor the 
Points in the minimum space, and (c) a plot showing the form of the 
relation between the initially unknown distances and the experimental 
Proximity measures. It is important to note that only two conditions, 
minimum dimensionality and invariant monotonicity, lead to a unique 
Solution. A computer program was also developed (Shepard, 1962a, b) to 
implement the numerical methods of the analysis. Finally, Shepard also 
Considered the implications of this technique for factor analysis, standard 
multidimensional scaling, and ordered metric scaling. 

A further elaboration of Shepard's work was undertaken by Kruskal 
(1963), who pointed out that Shepard's technique still lacked a solid 
Statistical foundation—that is, an explicit mathematical-statistical defini. 
tion of how well the data have been compressed under the criterion of 
Monotone invariance. Kruskal developed a theoretical rationale and prac- 
tical procedure for attacking this problem. He defined a quantitative 
Measure of fit called stress (a measure of how much the theoretical con- 
figuration differed from the raw data) and developed a ee pro- 
Bram to find the configuration that would ee ee ae 
i is program also fenus the rt of certain non-kuclide ce func- 
lons, su Minkowski r-metrics. . 

a ‘Gondated) independently considered < prahem 3 enlarg- 
ing the class of distance measures. It is of interest "n a ge c— 
TOm a set of presuppositions completely dint m DW m s, he 
Came out with essentially the procedures stated a us eit 
inhibited: from coming to a full feasible solution by the lack of high. 


Speed computer facilities. 


, Another approach to th l n 
tion from ordinal distance informatio 


Beneřalization of Coombs’ unfolding tec 


e derivation of a multidimensional configura- 
n was attempted in terms of a 
hnique from its unidimensional 
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basis to the multidimensional case (Bennett and Hays, 1960; Hays and 
Bennett, 1961; McElwain and Keats, 1961). . . 

Ekman (1963) generalized direct ratio scaling methods into a multi- 
dimensional model and suggested a procedure in which the observer 
judges the proportion of a standard percept that each stimulus percept 
contains. The model represents these judgments as projections on unit 
reference vectors, from which scalar products are derived as a basis for 
factor analysis. 


Applications 


Multidimensional Scaling has been applied in widely different areas 
during the past three years, including studies of emotions in faces (Abel. 
son and Sermat, 1962), of the perception of political leaders (Messick, 
1961), and of the social dimensions of a jury table (Strodtbeck and 
Hook, 1961). Wilson (1963) performed a multidimensional analysis of 
confusion errors (Shepard, 1957) in an attempt to identify cues used in 
the recognition of English consonants, and Keats (1962) used multidi- 
mensional unfolding to compare preferences for artists, heroes, saints, and 
scholars in Malaya and Australia. 

One difficulty in applying standard multidimensional scaling procedures 
is the enormous number of judgments required from each subject. One 
approach to reducing this large number of judgments was suggested by 
Gulliksen and Tucker (1961) for unidimensional scaling. Their proce- 
dure utilized multiple rank order judgments obtained in balanced incom- 
plete block designs as a basis for deriving paired comparison information. 


Individual Differences 


jor approaches have been followed in analyzing 
rom groups of individuals. One approach has been 
to compute group averages and to generalize the findings “on the average. 
The other procedure has been to analyze each person’s responses separately 
and to tabulate the results individual by individual. Although the first 
method is straightforward and frequently used, it is possible that the 
averaging of responses across subjects may mask consistent individual 
differences in reaction to the stimuli and thus may produce a conglomerate 
result that may not adequately Summarize the individual data (Messick. 
1961). The second method of treating each individual separately also 
presents difficulties, particularly in describing the results for groups of 
i i lé results for several individuals and 


that both isolates consistent individ 
and derives separate scales for each 
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Bennett, 1961). Models and procedures for determining dimensions of 
individual differences have been presented for unidimensional scaling by 
Tucker (1960) and Slater (1960). and an individual differences model 
for multidimensional scaling was described by Tucker and Messick (1963). 
Helm and Tucker (1962) applied this latter model to the multidimensional 
scaling of colors and isolated separate color spaces for corresponding 
individual differences in color deficiency. | 


, Computer Models 


A relatively new approach to model building using computers and com- 
puter languages has begun to bear fruit in recent years. We have seen in an 
earlier section how compuler models for multidimensional scaling with 
an unknown distance function (Kruskal, 1963; Shepard, 1962a, b) have 
made feasible the tedious numerical analyses required for a solution. It is 
hoped that similar computer formulations will also prove useful in other 
areas for the derivation of complex measurement parameters. 

ccause the computer permits a massive increase in the complexity of 
the relationships that can be manipulated and also permits dynamic inte- 
Station over time, computer models offer a powerful adjunct to mathe- 
matical models in areas involving complex cognitive and personality 
Processes (Borko, 1962: Green. 1961, 1963: Newell and Simon, 1963; 


omkins and Messick. 1963). 


Reviews and Integrative Summaries 


Developments from 1960 to 1962 in scaling. test theory, and factor 
analysis were reviewed by Tucker (1963). Recent contributions in scaling, 
Including applications in psychophysics, test theory, and the measurement 
of utility, were presented in Psychological Scaling: Theory and Applica- 
"ons (Gulliksen and Messick, 1960). Restle (1961) developed a mathe- 
tica] theory of choice and discussed its application in areas of utility 
Measurement. recognition, detection, psychophysics, and response limes, 

Luce, Bush. and Galanter (1963a) edited an impressive three-volume 
Mandbook of Mathematical Psychology, of which the first volume has 
"Ppeared, This handbook. written by and for professionals in the field, 
v and an analysis of the major mathematical 


Proyi 
! Ovides both a summary i » À 

leories in psychology. By a careful, and frequently formal and axiomatic. 
Presentation of the theoretical structures under consideration, this volume 


Serves to explicate the assumptions underlying mathematical psychology 
™ general &s well as psychological measurement in particular. The pres- 
“ntation, which is maintained at a uniformly high level. offers gn ummense 
Compression of material while keeping the essentials articulated. The 
Opics covered in the first volume include basic measurement theory, choice 
eXpe discrimination. psychophysical 


539 


"ments, detection and recognition. 


Review or EDUCATIONAL RESEARCH Vol. XXXIII, No. 5 


scaling, stochastic latency mechanisms, estimation HE MEC gua 
computers. Luce, Bush, and Galanter (1963b) have also ie dea 
volume companion book of convenient source material for the resea 

and teacher of advanced psychometrics. 


Mathematics and Psychology 


In conclusion, the writers would like to emphasize a point bearing upon 
the presumptions of mathematical psychology. Many of the existing mathe- 
matical theories of psychology have been constructed as formal systems 
and have made explicit their mathematical properties and the uer 
required for their application. However, these formulations are essentia ; 
theories of human processes cast in mathematical terms, and the role o 
theory in directing and even controlling research is well known. Limita- 
tions in current psychological views of human processes, built into the 
mathematical theory as axiom and supposition, will still be present in the 
transformed notation, along with additional constraints that are associated 
with the structure of the superimposed mathematical system. These limita- 
tions will directly influence the form that subsequent empirical research 
takes, and this influence, which is pervasive, extends from the manner in 
which data are gathered to the choice of topics deemed important. 

For formal theories—mathematical, statistical, or logical—to become 
empirical theories they must undergo a process whereby the elements and 
rules of the formal model are interpreted and given empirical meaning. 
Propositions true in a formal system by implication also need to be true 
in an empirical sense after interpretation. This most important topic of 
investigating the psychological meaning and relevance of the axioms has 


not yet received in any measure the Systematic effort so far invested in 
formulating the axioms. 
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CHAPTER VIII 


Statistical Learning Models 


RONALD A. WEITZMAN 


r 

Pur STATISTICAL models represent perhaps the newest and fastest-grow- 
ing field in psychology. It would be fatuous if not impossible to attempt 
to trace the growth of this field by means of a graph showing the number 
of articles and books published annually since 1950, when W. K. Estes 
turned the first patch of what has since proved to be very fertile soil. 
Considering together the facts that in 1950 there was only one published 
article and that in the past three years, the period under review, there 
have been so many articles that it has been difficult to select the most 
significant 40 (the editorial limit) for discussion is sufficient to give an 
idea of the pace at which the field has been developing. 

Originally, the application of statistical models in psychology was exclu- 
sively in the field of learning. Although most of the recent work has con- 
tinued to be done in this field, there has been a considerable body of work 
done in the past few years which represents an extension of the basic 
ideas of the learning models to other fields of psychology. When the 
models were confined to the study of learning, they did not have the status 
of a field. Now, however, because of the extensions to other fields, the 
models themselves must be considered a field. Indeed, a text devoted exclu- 
sively to statistical models in psychology must appear before long. 

The statistical models are a variety of the mathematical models which. 
since the days of Fechner and Ebbinghaus, have been at least somewhat 
familiar to most psychologists. What distinguishes the statistical models 
is not only that the dependent variable in their observation equations is a 
statistic (either a mean, ʻa variance, a correlation, or a proportion) but 
also that the assumptions on which they are based uniformly include at 
least one statement involving probability. However, the independent varia- 
bles do not fall so neatly into one category as do the dependent variables. 
for, in practice, they have included not only trial numbers, random varia- 
bles representing correct and incorrect responses, and proportions of 
reinforcements (in partial-reinforcement experiments) but also unfitted 
parameters. 

Recently, the development of procedures for evaluating theoretically a 
variety of statistics has been of major concern to modelists, Earlier, interest 
had been confined mainly to the theoretical proportion of correct responses 
made by a group of subjects on each trial and especially on each of the 
later (asymptotic) trials of a learning experiment. Since these statistics. 
however, have failed to discriminate among the numerous learning models 


543 


Review or EDUCATIONAL RESEARCH Vol. XXXIII, No. E 


which have been presented, other statistics have been sought and eset 
ered. Among these are mean numbers of runs of correct (or mon 
responses of various lengths, the proportion of correct (or weed 
responses following a reinforced (or nonreinforced) trial, the correla ^ 
between random variables representing responses separated by a certair 
number of trials, and the variance of the total number of correct (or meor: 
rect) responses made by all the subjects in an experiment. Some of man 
statistics can be applied to individual subjects as well as to groups o 
subjects, and some have even been used to test for individual differences. 
When a number of theoretically evaluated statistics are available for 
several models, comparisons of these statistics with those computed directly 
from data afford a means for discriminating among the models and also 
for assessing the strengths and weaknesses o 
Another recent development h 
statistical nature. Most psychol 
Mosteller models which are der 
are psychological only in application, the Este: 
matical formulations and derivations of b 


Í each. 
as been of a psychological as well as F 
ogists know that, whereas the Bush and 


s models represent mathe- 
asically psychological ideas. These 
a Guthrian-Hullian nature. In the 
the Estes group of modelists has moved 
m the Hullian to the Guthrian position. 
(Restle) has become frankly cognitive, 
ct renounced previous theoretical posi- 


l arguments (psychological and mathe- 
matical) and empirical results appear to have heen responsible for the 


In addition to the statistic 
and Mosteller, certain nonst. 


time to time. Some of these 
based; among 


al models furnished by Estes and by Bush 
atistical learning models have appeared from 
are empirically based, and some, theoretically 
the theoretically based, some are cognitive, and some are 
Tesponsestrength models, One of these models will be discussed briefly 
in the latter part of this chapter. The major portion of this chapter, 


however, will deal with alterations, empirical tests, and extensions of the 
Estes and Bush-Mosteller models, 


Alterations 
The Pattern Model 
In a series of papers dating 


been brought to bear once aga 
issue which Krechevsky and S 


from a paper by Rock in 1957, evidence e 
in on the old continuity-versus-noncontinu?y 

pence in the 1930's and early 1940’s brough 
so forcefully to the attention of Psychologists. Statistical learning theorists 
have become interested in the resurrection of this apparently resolved (in 
favor of the continuity position) issue, and an important part of their 
544 
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motivation has been theoretical. The original Estes models, though based 
on the assumption that learning occurs in an all-or-none (noncontinuous) 
manner, had been able to account for the appearance of continuous learn- 
ing by a device similar to that which Guthrie had used earlier in similar 
circumstances: an S-R connection is formed or is not formed on an experi- 
mental trial, but many such connections may be necessary to produce the 
response that the experimenter considers correct. In fact, Estes had identi- 
fied the probability of a correct response on an experimental trial with the 
proportion of stimulus elements conditioned to that response on that trial. 
Many varieties of model based on this fundamental identification had been 
devised, but all suffered from the same theoretical weakness: they yielded 
explicit formulas for very few statistics (an embarrassing situation for 
à statistical model). . . 

In 1959, Estes introduced the pattern model to psychological literature 
and indicated that this model had the power of producing explicit formulas 
for many statistics. Bower (1961) proved this when he presented explicit 
formulas for 22 different statistics derived from the pattern model! The 
pattern model, however, precludes the former identification of proportion 
of conditioned stimulus elements with probability of a response because 
it is based on the assumption that only one stimulus element pum is 
present on each experimental trial. The gauntlet has been taron: ither 
the pattern model is wrong, or learning is of a demonstrably all-or-none 


nature. 

Recent evidence in favor of allor-non 
forms. One of the most interesting is t 
(Estes, 1960) —an experiment consisting our f 
of which only the first involves reinforcement (the remaining trials being 
“test” trials). Results of such experiments favoring the noncontinuity view 
were reported by Estes (1960, 1961) and by Estes, Hopkins, and Crothers 
(1960). Jones (1962) described results of S = kind of experiment 
whi 7 n ly partly favorable to this view. . 
bp pieces dme : er another line of evidence. 


igni i ms to be developing ov 
Bose eri aea ez) and Trabaseo (1963) showed that the average number 


i i fore the last 
(or proporti of errors remains constant over trials be 
ma Lesen trials). These results are vo n bea fa 
noncontinuity view. However, after presenting simi a aus b ippes 
and Ginsberg (1963) indicated that when learning oe a 
for individual differences by Vincentizing, the propor of p tio 
Trabasso (1963), meanwhile, 


i i tant. 
period errors no longer remained constant 
found that tien iow a similar correction by the method of backwards 


B 
earning curves, this proportion still remained € Perhaps Tineo s 
Subjects wcre more homogeneous than were those E x awe be erg. 
Who included not only adults but also children - ES in s ely i 
Who explicitly presented evidence for individual ditte g 


i / lt stands against that of 
child ts. Even so, Trabasso’s resu a 

Suppec and dan "ind the cause of the discrepancy remains to be found. 
545 


e learning has come in a variety of 
he so-called miniature experiment 
of just three, four, or five trials, 
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Additional References: Peterson and others (1962) : Reynolds (1963): 
Simon (1962) : Waugh and Smith (1962) : Williams (1962). 


A Generalization of the Pattern Model 


The pattern model, which involves only one parameter, was generalized 
by Suppes and Atkinson (1960) to allow for a number of parameters. 
In the original simple pattern model, the single parameter represents the 
probability that a reinforcemerit on any trial will be effective. In the 
generalized version of this model, a different parameter is possible for 
each response-reinforcement situation. For example, Suppes and Schlag- 
Rey (1962) presented, among other generalized pattern models, a model 
that involves, for trials following two reinforcements, a parameter that 
differs from the themselves-differing parameters for trials following only 
one or zero reinforcements, respectively, on the preceding two trials. 


An Incremental Model 


Atkinson (1961) marched off briefly in the opposite direction and sug- 
gested an incremental model wherein a stimulus element may be in one of 
a finite number of conditioning states on any experimental trial. The maxi- 
mum number of states possible for an element in this model is assumed to be 
a function of motivation or reinforcement, and the probability of a correct 
response on any experimental trial is identified with the ratio of the state 
number of the stimulus element present on that trail to the highest state 
number possible for that element. Application of a single-stimulus version 
of this model to data involving different numbers of possible responses 
has shown that the maximum number of states varies with the number of 


possible responses and so is not. as originally hoped, simply a function 
of motivation or reinforcement. 


A Strategy Model 


Restle (1961, 1962) presented a strategy model which represents a 
mathematical rendition of the cognitive version of the noncontinuity view 
which Krechevsky presented a quarter of a century earlier. This model 
resembles the simple pattern model in that one clement is assumed to be 
sampled on any experimental trial 


and in that if this element is aa’ 
conditioned a correct response will occur. [It should be noted that Restle 
(1962) showed that the model is 


. e 
unchanged if the assumption of ie 
element is relaxed.] Important differences between this and the patter 


model, however, are (a) the interpretation of the elements available = 
sampling, which here are strategies (hypotheses) rather than stimulu 


vhich 
patterns, and (b) the number of parameters, which here are three and whit 
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represent the proportions. respectively, of correct, wrong, and irrelevant 
ier qni cap en a partolar experimental situation. On any experi- 
al, a subject is assumed to sample a strategy randomly from 

the population of possible strategies: if he samples one of the correct 
strategies, he will respond correctly; if he samples one of the wrong 
strategies, he will respond incorrectly; and if he samples one of the irrele- 
vant strategies, he may or may not (by chance) respond correctly. More- 
over, once a correct strategy has been sampled, and the response made in 
accordance with it is reinforced, the subject will continue to act only on the 
basis of that strategy. Restle (1962) maintained that this model is superior 
to his previous adaptation model on logical grounds; he demonstrated 
that in the concept identification experiments where the older model had 
been applied, it can account for the older model's successes and perhaps 
succeed where the older model failed. 
The successes of the adaptation model have usually involved the prin- 
ciple of additivity of cues. Trabasso (1960), for example, showed that in 
using the sum of parameters estimated separately in two experiments, each 
involving a single relevant cue, he could successfully predict (by way of 
Restle's adaptation model) the number of errors made in an experiment 
in which both these (and only these) cues were relevant. An example of 


the failures was reported by Levine (1962), who showed that prolonged 
random reinforcement during pretraining, which according to the adapta- 
all cues presented, does 


tion model should result in complete adaptation of 
not have this effect; in fact, subjects are able to identify concepts correctly 


on the basis of cues which should have been rendered functionless (which 
should have been adapted out) during previous random reinforcement of 


them. 
Additional References: Bourne and Bunderson (1963) ; Estes and Hop- 
kins (1961) ; Pishkin (1960) : Richards (1962). 


Empirical Tests 


Generally speaking, empirical tests of the models have been rather hard 
on them. This situation exists despite the fact that the immediate impression 


received on reading any single major article is usually the contrary one: 
namely, that the model fits the data quite well. The reason for this discrep- 
ancy seems to be that à model's originator reports (honestly) favorable 
results, whereas others only later adduce negative evidence. The time lag 
is such, however, that at any given moment it has seemed that at least one 


modelist has indeed cracked the code" of the data. 


The Matching Rule 
(1961) showed that a variety of models can predict 


Feldman and Newell 
the matching of the asymptotic proportion of a response with the propor- 
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i i i en reinforced in the course of an experiment and 
cwm pic y a ne or nonoccurrence of matching is not 
ee exclusively in favor of any one model. The evidence regarding 
ws appearance of matching, moreover, has always been mixed. ai 

Additional References: Bourne (1963); Chapman (1961); Der > 
(1962); Greeno (1962); Hickson (1961); McCracken, Osterhout, an 
Voss (1962) ; Nies (1962) ; Shaffer (1962, 1963) ; Witte (1961). 


Conditional Dependencies 


Of continued interest is the proportion of correct (or incorrect) responses 
on a trial following the occurrence of reinforcement (or nonreinforcement) 
on the preceding trial or trials (event dependencies) or following the 
occurrence of a correct (or incorrect) response on the preceding trial or 
trials (response dependencies) . Classifying dependencies not only as event 
and response but also as absolute (for example, in the case of an absolute 
event, dependency there being zero, one, or two reinforcements on the 
preceding two trials) and relative (for example, in the case of a relative 
event, dependency there being two possible perseverations—reinforce- 
ment-reinforcement and nonreinforcement-nonreinforcement—and two 
possible alternations of reinforcements and nonreinforcements on the pre- 
ceding two trials—reinforcement-nonreinforcement and nonreinforcement- 
reinforcement), Suppes and Schlag-Rey (1962) 
models to analyze the asymptotic data of 


They concluded that response dependencies are more effective than event 


dependencies and that relative dependencies are more effective than 
absolute dependencies. 


Additional References: Anderson 
(1961) ; Derks (1963) ; Edw 


used generalized pattern 
a two-choice learning experiment. 


and Whalen (1960); Chapman 
ards (1961) ; Prokasy (1961) ; Witte (1961). 


A Variety of Statistics 
On a number of Occasions, 


sometimes to evaluate a single 
pare the relative fits to à singl 


a variety of statistics has been presented, 
model against data and sometimes to gom 

e set of data of two or more models. E 
not all models readily yield a large number of statistics in the form o 
explicit formulas, resort has occasionally been made to Monte Carlo pre 
cedures for computing statistics. Flood (1962) recommended and illus- 
trated the use of Monte Carlo procedures on a high-speed computer as 4 
general method for testing and comparing models. Bower (1961!, however, 
was able (as noted earlier) to derive explicit formulas for 22 statistics 
Írom the assumptions of the pattern model and, in illustration of their 
use, showed that 17 out of 19 of them fitted the data of a paired-associates 
experiment with college student Subjects better than the corresponding 
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statistics of a Bush-Mosteller model (the only other model for which such 
a large number of statistics had been derived in explicit form). Modelists 
consider that the power of a model to yield explicit formulas for statistics 
is an important property, and on the basis of such a criterion the pattern 
model must be rated very high indeed. 

What appears to be the greatest success of any statistical learning model 
in accounting for the data of an experiment was reported by Bower (1962) 
for the pattern model. From the data of a paired-associates learning ex- 
periment involving a number of groups of college students, 24 statistics 
were computed, each by theoretical and einpirical formulas. Although the 
same parameter value was used in each of the theoretical formulas, all 24. 
comparisons were numerically close. 

These results of Bower's are so startling that they immediately occasion 
this question: Is there not some artifact or special condition which accounts 
for such good results? There does appear to be one. In the same paper, 
Bower reported that the variance of total errors for at least some of the 
groups was so low that it precluded the possibility of individual differences 
in parameter value. Since the single parameter of the pattern model repre- 
sents learning rate. this result gives rise to the suspicion that the tasks for 
the various groups were uniformly so easy that such differences in intelli- 
gence as might exist among the homogeneous population of college students 
did not extend downward sufficiently to be reflected in differences in the 


parameter value. . 

This suspicion has been confirmed by Suppes and Ginsberg (1963), 
Who, in applying the same model to the data of concept-formation experi- 
ments with children, found not only that the children's and college students 
parameter values differed but also that there were individual differences 
among the parameter values of the children themselves. Had Bower 


carried out his experiment with children or under conditions of partial 
rather than continuous reinforcement, almost certainly his results would 
not have been so good. The generality of Bower’s results is thus ques- 
tionable, and at present there seems to be no way of testing the pattern 
or any other model so extensively under conditions wherein individual 


differences are expected to obtain. 


Extensions 


al learning models have been used in models which 
henomena not only outside the field of learning 
the field of learning—areas in which the older 


Basic ideas of statistic 
describe psychological p! 
but also in areas within 
models do not directiy apply- t 


Social Psychology 


Suppes and Atkinson (1960) made an important extension of the pattern 
mode* to describe the behavior that occurs in a number of versions of 
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games for two or more persons. These games are ni ede et T 
actions, many varieties of which have been studied extensively y econo i 
game theorists, who suppose that in such situations human beings n 
behave in a strictly rational manner. Psychologists have long thought ai 
supposition generally incorrect; and in the experiments reported by Suppes 
and Atkinson, the models, which do not require strictly rational behavior. 
are shown to describe the data well. "MD 

The conceptual device used to effect this extensior is interesting in its 
own right. Statistical learning models have lent themselves particularly 
well to the study of partial-reiuforcement situations in which the proba- 
bility of reinforcement is contingent on the subjects choice. In a two- 
person game, this contingency is broadened to include the choice of both 
subjects, and it is not difficult to alter a model which applies in the one 
case so that it will apply in the other as well. 


Additional References: Cervin and Henderson (1961); Suppes and 
Krasne (1961). 


Concept Formation in Children 


Suppes and Ginsberg (1962, 1963) 
to study concept formation in children. 
they applied the pattern model in an e 
to learn to discriminate between the 
successively and expressed in binary 


symbolic expressions. (For example, the symbolic expressions aAA an 

ada, respectively, represent in binary notation what in common notation 
are represented by the numerals 4 and 5.) They found that children may 
learn to discriminate between such expressions by identifying each one 
separately in terms of the particular combination of symbols in it rather 
than by assigning it to a class, the members of which share a common 
property. They also found that, different from adults in similar situations. 
children seem to learn faster—by showing appropriately different value= 
of the parameter, which represents the effectiveness of each reinforcemen! 


on learning—when they are allowed to correct their mistakes than when 
they are only shown them, 


set out in an ambitious program 
In their 1962 study, for example. 
xperiment in which children were 
numbers four and five presented 
notation by means of a variety of 


Response Magnitude 


much less attention than it deserves, In the period under review, the most 
important attempts to remedy this Situation were made by Suppes ay 
Frankmann (1961), Suppes and Zinnes (1961), and Anderson (1961)- 
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The purpose of these studies was not to describe response latencies (de- 
scriptions of which have on rare enough occasions been attempted), but 
to describe response magnitudes. In the experiments by Suppes and 
Frankmann, for example, subjects were required to indicate where on the 
circumference of a circle a spot of light would subsequently fall, and in 
those experiments which Anderson’s models are intended to describe. 
subjects were required to guess lengths of lines. 

One of the results of the experiments by Suppes and Frankmann is of 
special interest. In these experiments, it was arranged that on every trial 
a light would fall on a point in an arc of fixed length in such a way that 
the probability that it would fall on any particular point in the arc was 
proportional to the height above that point of a triangle which could be 
constructed over the arc as its base. The shape of the histogram describing 
the asymptotic predictions of the subjects, however, was not triangular, 
This result is contrary to the matching rule which requires that reinforce- 
ment and response probabilities be asymptotically equal. 


Reaction Time and Other Topics 


LaBerge (1962) presented a model based on stimulus sampling ideas 
which is intended to account for results of reaction-time, simple-discrimina- 
tion, and signal-detection experiments. The basic ideas of this model are 
that (a) on any experimental trial there is a constant proportion of stimuli 
which, when a sufficient number of them have been sampled, will lead to 
à recordable or correct response, (b) each stimulus is sampled randomly 
and one at a time from the entire population of stimuli present on the 
trial, and (c) each successive sampling requires the same amount of time. 
LaBerge showed the model to be consistent with both reaction-time and 
signal-detection data reported elsewhere in the literature. 

Additional References: Atkinson (1961, 1963) : Carterette (1961) ; Luce 
(1963) ; Shaffer (1962). 


Other Models 


McGuire (1961) presented a three-process, response-strength model to 
describe the data of paired-associate learning experiments. The three proc- 
'sses involve the development of connections between (a) external stimuli 
and internal responses, (b) internal stimuli (scimulus properties of the 
internal responses) and external responses, and (c) external stimulus- 
response pairs. As indicated by Bower (1962), a model such as this one 
of McGuire’s may be applied to paired-associate learning experiments 
where not only the connections between the stimulus and response elements 
but also the response elements themselves are to be learned. In the experi- 
ments in which Bower's model has shown great success, the responses were 
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not, like the stimuli, nonsense syllables, but were numbers, obviously 

sady. quite: familiar to the subjects. M 
uror don References: Feigenbaum and Simon (1961. 1962); Rey 
(1961); Kanal (1962a, b); Lamperti and Suppes (1960); Levine 
(1963) ; Murdock (1960) ; Pubols (1962). 


Prospectus 


The immediate future should.embrace not only more tests, alterations, 
and extensions of extant models but also the creacion of new ones. b 
pattern and strategy models represent the latest signposts along a new an 
adventuresome road. The pattern model, which has given rise to so many 
statistics and which has been applied with such success and utility not 
only in the field of learning but also in other fields, stands as the greatest 
single achievement of the period under review. Plenty of room for ex- 
tension and improvement remains, however, and can be scen from a con- 
sideration of the following facts: (a) the limited applicability of current 
models, which describe only behavior involving a choice between two or 
more responses; (b) the unreasonable assumption, relaxed only in a model 
which LaBerge introduced in 1959, that stimuli are conditioned exclusively 
to responses in which the experimenter is interested (see LaBerge, Greeno, 
and Peterson, 1962) ; (c) the absence of adequately testable models which 
describe behavior wherein there are individual differences; (d) the lack 
of knowledge of the sampling distributions of almost all the statistics 
derived from extant models—knowledge which is necessary for tests of 
goodness of fit; and (e) the treatment of learning in such a mechanical 
fashion that the greatest success of existing models has been and can only 


be expected to continue to be in situations so devoid of emotional involve- 
ment and intellectual challenge that all they require of adult human 
subjects is robot behavior. 


Although the field is open 
is already in the air a humr 
which, compared with curr 
stronger but also psychologic 


and the soil has hardly been broken, there 
ming that betokens the building of models 
ent models, will be not only statistically 
ally sounder and more widely applicable. 
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CHAPTER IX 


Decision Theory and Utility Management 


ROBERT W. EARL 


Since Girshick presented his elementary survey. of statistical decision 
theory in the December 1954. issue of the Review, a tremendous amount 
of research has been published by behavioral scientists in journals that 
fall outside the focus of most workers in educational research. The new 
mathematical psychologists who have made important methodological 
contributions to the understanding of problems in decision theory and 
utility measurement have not been very much concerned with problems of 
immediate or practical interest to the educati 
riculum specialist, the counselor, 
of the methodological problems 


onal psychologist, the cur- 
or the school administrator, Once many 
have been resolved and a substantial 
amount of experimental work has been done to ascertain the suitability of 
the somewhat esoteric models in decision theory to the understanding 
and to the potential solution of certain educational problems, there will 
probably be little use of decision theory by research specialists in educa- 
tion. In the writer's estimation, at least 10 or 15 years will elapse before 
any marked application of decision theory will be made to the types of 
educational problems faced by most educational researchers. . 
In view of the substantial overlap of the field of decision theory with 
(a) the problems of statistical inference, considered hy Clark in Chapter I 
and by Michael in Chapter IT; (b) the problems in psychophysics. signal 
detection, and information theory as well as general developments in scal- 
ing and measurement theory, presented by Schiffman and Messick ne 
Chapter VII; (c) statistical learning models, detailed by Weitzman 1” 
Chapter VIII; and (d) computer technology, as described by Baker 
in Chapter X, the writer decided with somewhat mixed feelings that his 
presentation should be drastically curtailed in length and in scope | 
tive to what he had initially planned to undertake. Nevertheless, a detaile 
survey was made of pertinent literature since. about 1950. For the most 
part, references since 1954 have been included. Probably the most im- 
portant contribution of this chapter will be to furnish the reader with a 
bibliography of slightly more than 100 carefully selected references which 
furnish what may be considered an appropriate base for the Aequi 
of an understanding of decision theory and utility management in the 
behavioral sciences. These references were selected on the basis of nk 
sive consultation with severa] mathematica] psychologists who have specia- 
ized in the study of decision theory, 
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General References 


The literature of decision theory varies considerably in the mathematical 
sophistication it demands of its readers. The interested reader can study 
the theory of decision in considerable depth by consulting the following 
references. which, although differing in levels of mathematical maturity 
required for their comprehension, tend to be general in treatment: 
Adams (1960), Bakan (1953). Bross (1953), Chernoff and Moses (1959). 
Edwards (1954, 196!, 1962a, b. c). Edwards, Lindman, and Savage 
(1963). Raiffa and Schlaifer (1961), Savage (1954), Schlaifer (1959). 
Von Neumann and Morgenstern (1917), and Wald (1947, 1950). 


Some Basic Considerations 


Recent books and articles on “statistical decision” theory contain a 
number of remarks at variance with the fiat and scripture that currently 
dictate the practice of statistical generalization in scientific research and 
industrial quality control. For example, Schlaifer (1959, p. 629), in a 


moment of restraint, stated that traditional hypothesis testing, which in 
leads to absurd and dangerous con- 


his opinion is worse than useless, 1 
) stated that it is not unusual 


clusions. Similarly, Savage (1962, p. 181 
for a t value of three with many degrees of freedom to support the null 


hypothesis. Raiffa and Schlaifer (1961, p. 41) stated that an experimenter 
» ; 
may terminate an experiment for “whatever reason, provided that the 


manner of the data analysis is Bayesian. For stopping an experiment, 
Edwards, Lindman, and Savage (1963) listed such possible “reasons 
whatever” as being tired, answering the telephone, running out of money, 
feeling one has sufficient data to prove a point, but not necessarily being 
chased away by lions. 

In his most recent joust with folklore, Binder (1963) took exception 
to Grant's (1962) argument that rejection of the null hypothesis can sup- 
port an empirical hypothesis while acceptance of the null hypothesis 
cannot. Grant’s point of view possibly derived from the parallel suggestion 
of Francis Bacon that scientific induction is an asymmetrical process under 
which enumeration of confirming instances is "exposed to danger from one 
contradictory instance" while reliance on negative instances to eliminate 
false hypotheses is not. . 

As the first psychologist since Lewin and his associates (Cartwright, 
1941; Festinger, 1942, 1943) to become inseparably identified with deci. 
sion theory, Edwards (1961) reacted to recent developments in mathe- 
matical statistics by issuing a call for someone to lead psychologists out 
of the wilderness of ttests and F-tests. Failing to elicit a response to his 
call, he answered the call himself (Edwards, Lindman, and Savage, 1963). 

Although the authors of the preceding statements refer to hypothesis 
testing and traditional tests of significance, they are actually arguing 
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within the slightly vorm puis of ethics, aesthetics, logic 

; gy, ampliative induction, 
r. w— amp theory, and behavior theory, whether of man, manse: 
or machine, to give but a small sample of the topics upon which their 
theories bear. Nominally, the articles cited in the bibliography in this 
chapter are largely directed toward the study of (a) optimal gambling 
behavior; (b) optimal economic decision making; (c) optimal human. 
animal, or machine decision making, regardless of context: (d) common 
day-to-day, infrequently optimal decision-making behavior of human be- 
ings or animals; (e) human data processing; and (f) classical versus 
Bayesian statistical inference. By contrast with the breadth of the subject 
matter it entails, the theory is quite simply called decision theory. 


, metaphysics, 
inconclusive argument, statistical 


Central Concepts, Questions, and Difficulties 
The origins of decision theory can 
attempts of early Renaissance mathematic 
in connection with commercial insurance a 
conceptualization of the problem of decisi 
with the work of Laplace. The major con 


be traced to the unsuccessful 
ians to answer questions raised 
nd gambling. The first successful 
on making came later, primarily 
cepts developed during the eight- 
nd expectation, Attempts to define 
eryone are still in progress and 
eory. 
hich is ordinarily used both as a term 
S a synonym for the relative frequency 
e events of the real world, is a source of 
add to the confusion, psychologists such 
as Festinger (1942), Atkinson (1957), and Edwards (1962b) have 
itions of Probability to those mentioned above. 
mathematicians, statisticians, and economists had 
of credibility or credence in lieu of the objective 
henever the objective approach created "logical 
difficulties, 


Probability 


In the mathematical f 


orinulation, probabili 
term that derives its m 


ly is treated as an undefined 
eaning indirect! 


y from the way that the term is 

ules of the mathematical system. 
Parenthetically, the Concept of randomness js treated similarly in the 
system. Coombs, Raiffa, and Thrall (1954) presented a very clear and 
understandable discussion of the distinction between probability as an 
abstract mathematical entity and pro 


bability defined with reference to 
experimental events. Probability, as the undefined component of a mathe- 
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matical system, is no problem in the present context. The problem faced 
by decision theorists is choosing an experimental definition of probability 
that will do what they want it to do and then sticking to it. ! 

The problem of the reader is even more difficult than that of the theorist. 
The reader must discover the theorist’s definition. if this is possible, and 
then must make sure that the theorist adheres to it. Since the reader is 
ossible, he must catch the theorist each time he shifts 
and finally must determine how many different defini- 
Experience suggests that the 
is field at least, have more 
from the next 


demanding the imp 
to a new definition 
tions were used and precisely what they were. 
burden is on the reader, for the theorists, in th 
important problems than to satisfy the reader, as one may see 
few paragraphs. 


Mathematical Expectation 


Mathematical expectation is the second major concept in decision theory. 
Mathematical expectation, or expected value, is simply a synonym for the 
mean, or average. For example, the scores of three students on a final 
examination are 2, 5, and 8; the expected score is 5. This profound con- 
cept, and it is exactly that, is symbolized E(x) =5, where x represents a 
score, The importance of this concept to decision theory is that a decision 
maker faced with the necessity of choosing between two or more alterna- 
tives will choose the alternative associated with the maximum (or mini- 
mum) expected value. Given a choice between teaching a class with 
E(x) =5, E(x) 26. E(x) =7, a decision maker would choose the latter if 
he valued the scores in proportion to their magnitude. If he valued low 
scores more than high scores, however, the decision maker would choose 
E(x) =5. This example merely illustrates the importance of the concept of 
expectation to decision theory; it does not justify the decision. 

With no special provocation, one could argue that the environment that 
gave rise to decision theory has determined, or more precisely has re- 
stricted, the selection of the variables upon which the theory is based. The 
probabilities obviously derive from games of chance; the scores, or x 
values, are simply amounts of money (e... dollars, rupees); and the ex- 
pected values. F(x). are simply the amounts of money one is likely to win 
or lose per play in the long run. The decision criterion is to bet on the 
alternative that, on the average, will pay the most (or will lose the least, 
if winning is impossible and one is forced to play). The rebuttal to this 
argument is that in order to offer more than “good” advice to gamblers 
(whatever form the gambling activity takes) and to apply this conceptuali- 
zation to areas of scientific interest, the decision theorist has had to modify 
his theory in order to come to terms with reality. Thus, he has freed the 
theory from the original restrictions. For example, theory was applied to 
the prediction of human decision making in gambling situations as opposed 
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to the usual normative application of telling the gambler how he ot 
behave. Since the results were disastrous, the theory was modified in the 
simplest way possible. The axioms were not touched, but the — 
between the abstract concepts and the real world events were modified. 
with the result that to date (a) probability refers to the subject’s expressed 
impression of the chances that a given outcome, such as heads, will occur; 
(b) the value x now indicates the subject’s expressed. preference for the 
outcome relative to the others; and (c) the expected value is simply a pref- 
erence for an act of choice deterrained by the same formula by which one 
would compute the mean score of the students in a class, 


Conclusion 


Three comments should be made. First, 
theory is still normative in that it predicts that a decision maker will be- 
have optimally in terms of his subjective interpretation of the pleasures 
and chances in his environment, Second, the theory is applied with com- 
plete equanimity—on those rare occasions when it is applied—to any kind 
of decision maker worth imagining (the unworthy imaginables are eX- 
cluded) ; all living creatures and most machines are included. Third, the 


ans and of mathematically oriented 
preoccupation is the development of 
ms that can be deduced from them. 
€ identified with experimental events. 


‘t, a theory; and a theory can be tested 
against the phenomena it is supposed to describe, If the theory fails often 
enough, the theorist can modify it by changing the axiomatic basis or the 
identifications, or he can replace the model. 

Now the key point sts of decision theory against 
data are virtually nonexi 


in its revised form, decision 


called utilities in decision 
rank ordering of the x y, 


€ axiom systems have come about 
theorists to make an original con- 
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CHAPTER X 


Use of Computers in Educational Research 


FRANK B. BAKER 


S uox Martin and Hall wrote their chapter concerning data processing 
for the December 1960 Review. the computer field has been kaleidoscopic. 
with new developments appearing and older trends growing to new levels. 
Perhaps the most marked change has been in the equipment field. In the 
previous Review period, the basic computer was the IBM 650, although a 
few major universities either had built their own large-sc 
had acquired a large-scale machine returned to the m 
aerospace industry. A significant change in this pattern h 
as a result of the convergence of four factors: 
transistorized computer, (b) change in the fiscal point of view within 
university administrations, (c) substantial educational discounts (up to 
60 percent) from the manufacturers, and (d) significant financial support 
from the National Science Foundation. Because of these four factors, most 
major universities have installed the newest and most powerful computers 
available, with Control Data Corporation 1604/s and IBM 7090's dominat- 
ing the large machine field. The holder of the unofficial title of the most 
powerful computer on a college campus has changed many times during 
the period under review, The current title holder is the Control Data 3600 
at Michigan State University. As of the most recent survey, there were 
299 computers either installed or on order for colleges and universities 
(Datamation, 1962). Unfortunately, many of the professional journals 
have not reflected this change in equipment and have continued to publish 
far too many prograr 


E ms for IBM 650s and similar obsolete machines. 
Accompanying the trend to the lar; 


ale computers or 
anufacturer by the 
as arisen, primarily 
(a) availability of the 


ge-scale “secord generation" computers 
has been a parallel trend to such small, less expensive computers as the 
IBM 1620 and the Control Data Corporation 160-A. The wide distribution 
and relatively low Operating costs of these small computers have enabled 
large numbers of persons to obtain firsthand experience in the use of 
computers. As a result, a market now exists for textbooks written around 
the design, operation, and programming of a particular computer. The text 
by Leeson and Dimitry (1962) for the IBM 1620 is an excellent example of 


this type of book, and similar efforts based upon the popular large-scale 
computers are certainly in order. 


A few years ago, a computer specialist speaking at an educational con- 


ference devoted the major portion of his presentation to explaining the 


characteristics of computers and a minor portion to discussing the relevant 
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research. Since the expanding availability of computers and the prominence 
of their role in research have resulted in an increased level of computer 
sophistication on the part of the audience. this introductory phase is no 
longer necessary. The rise in the level of sophistication of the audience is 
best exemplified by the increasing number of educational and psychological 
journals which publish computer-oriented research. Because of the shift 
in the publication pattern, this writer has attempted to restrict the present 
chapter to sources easily available to the educational researcher. Therefore, 
neither articles in the field of engineering nor mimeographed research 
reports are included. However, the omission does pose some limitations, 
as many new developments first appear in these sources. 


Developments in Computer Usage 


Karly in the period under review. the educational researcher encountered 
considerable difficulty in locating adequate generalized programs for the 
classical statistical analyses, such as analysis of variance, multiple regres- 
sion, and factor analysis. At the present time. this problem is essentially 
nonexistent—at least with respect to the popular computers. The excellent 
BIMED series of statistical programs reported by Dixon (1962) has been 
adopted at a number of installations. In addition. Leone and others (1961) 
published an extensive listing of the statistical programs available for 
many different computers. 

Since the need for fundamental statistical library programs has been 
met for the most part, the current programming effort has tended to be 
aimed at cither improving existing techniques or extending the range of 
Statistical programming. For example, in the former case, Walsh (1961) 
developed a technique for handling missing data in multiple regression, 
and Eyman, Dingman, and Meyers (1962) compared several of the tech- 
niques available for the rotation problem of factor analysis. In the latter 
Case, Cooley and Lohnes (1962) published a book of formula translation 
(FORTRAN) programs for a number of techniques in multivariate analysis 
Which are of particular interest to behavioral scientists. 

With the basic statistical programming needs met. some effort has been 
devoted to programs designed to perform analyses peculiar to education 
and psychology. For example, Baker (1963) developed a generalized item 
and test analysis program that scores each test paper consisting of multiple 
choice items and yields the following information: Hoyts coefficient of 
reliability based on analysis of variance procedure, frequency distribution 
of test Scores, summary statistics of the sample. and complete item analysis 
of all item response choices. In a clinical setting, Rome (1962) automated 
the Scoring of answer sheets of the Minnesota Multiphasic Personality 
Inventory (MMPI). Kleinmuntz and Alexander (1962) devised a program 
which’ performs a diagnosis of MMPI profiles based upon the Mechl- 
Dahlstrom approach. 
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Noncomputational Uses of Computers 


Although just emerging from infancy, the noncomputational uses of 
computers are assuming a significant role in computer-oriented research. 
Noncomputational usage is not clearly defined at present: however, what is 
suggested by such terms as simulation, pattern recognition, information 
retrieval, and teaching machines would encompass the major aspects of 
noncomputational applications. The expanse of only a part of the field 
could be demonstrated by a bibliography of articles on simulation of 
cognitive processes which itself would readily exceed 500 entries. 

The majority of published research in psychological simulation has 
employed the fifth version of an information-processing language approach 
known as the IPL-V programming system. Reported in a volume by Newell 
and others (1961), this procedure has been the outgrowth of the pioneering 
work of Newell, Shaw, and Simon (1958). This interpretative programming 
system, which manipulates lists of symbols rather than numbers, represents 
a considerable departure from ordinary scientific computer programming. 
This kind of system is, however, ideally suited for simulation. 


Simulation in Learning, Problem Solving, and 
Concept Attainment 


The Elementary Perceiving and Memorizing (EPAM) program used by 
Feigenbaum and Simon (1963) is an IPL.V program designed to perform 
roteememory tasks such as memorizing lists of nonsense syllables—a pro- 
gram using the same mechanism as employed by human subjects. Therefore, 
terms such as learning and memorizing, customarily employed to describe 
activities of human subjects, will be taken to represent the humanlike 
processes achieved by programs for computers used in simulation problems. 
The EPAM program depends upon a small amount of initial information 


by which the symbols are paired; during the siinulated experiment, this 
linkage is strengthened until the list is learned to some criterion of per- 
formance. In other experiments, the EPAM program has learned to pal! 
symbols representing the sound of an object with tke properly spelled name 
of the object. Tt should be noted, however, that despite the authors’ rather 
lengthy discussion of EPAM’s ability to match "auditory" and “visual 
stimuli to the proper object, the computer program involved does treat 
these lists of symbols as if they were just another list of symbols to be 
matched. Representation by the lists of both the proper spelling of the 
object’s name and a phonetic spelling is immaterial to the program. 
Laughery and Gregg (1962) used IPL.V to simulate the problem-solving 
behavior of a single human subject who was given as a task the determina- 
tion of the pattern in which four switches were being set. After considerable 
study of the “thinking aloud” protocols and the problem-solving behcviors 
of the subject, an IPL-V program was written to simulate the particular 
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person. Both the human subject and the IPL-V program were then pre- 
sented with an additional switch-setting problem. The subject and the IPL-V 
program generated protocols and problem-solving behaviors which ex- 
hibited a high degree of similarity. Laughery and Gregg’s article contained 
an explicit account of the painstaking process of developing a simulation 
program. 

From a programming point of view, the simulation of concept attainment 
is very similar to the type of problem programmed by Laughery and Gregg. 
Several IPL-V programs have been written for the simulation of concept 
attainment. Hovland and Hunt's (1960) first program restricted the diffi- 
culty of the concept attainment task by providing the subject with a con- 
siderable amount of information about the stimuli presented as well as 
about the concept to be learned. In a second version of the program, the 
subject had the ability to pay selective attention to certain aspects of the 
stimulus; in addition, the number of attributes employed in the hypothesis 
could be varied if additional attributes were needed to attain the concept. 
The article by Hovland and Hunt, which appears to be one of the earliest 
presentations of a computer program for concept attainment, would have 
been greatly enhanced by a program flow chart and by presentation in 
detail of some actual concept attainment runs. On the basis of their experi- 
ence, Hovland and Hunt (1960) suggested the inclusion of some additional 
capabilities in the IPL-V system to simplify programming of concept attain- 
ment problems. 

‘An area of concern in studies of concept attainment has been the search 
strategy (conservative focusing or gambling focusing) employed by the 
subject. Wickelgren (1962) wrote an IPL-V concept attainment program 
employing the “conservative focusing” strategy. Rather than assuming 
that the subject has complete memory of all information, as was the case 
in the program proposed by Hovland and Hunt (1960), Wickelgren as- 
sumed that the subject forgets blocks of data. The article by Wickelgren 
suffered from lack of detail concerning the actual implementation of the 
process. Since, in programs such as these, many of the coding decisions are 
“programmers choice,” they may influence the attainment of concepts far 
more than may the basic strategy employed. For example, if the list of 
stimuli should be searched in a linear fashion, the speed of attainment could 
well be a function of position, not strategy. The book by Hunt (1962) pro- 
vides an excellent overview of both the theoretical and the practical 
implications of treating, concept attainment as an information processing 
problem. For those readers unfamiliar with literature on concept attain- 
ment and its simulation, the book 3s a helpful starting point. 


Simulation of Behavior in Social Groups 


The use of the computer to analyze and simulate the behavior of social 
groups is a new development that should be of great importance to educa- 
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tional research. The analysis of sociometric data is a particularly persistent 
problem and has been since the inception of the sociogram technique. 
especially in those situations in which large samples are employed. 
Coleman (1961) developed a number of related computer programs for 
analyzing sociometric data for a number of different purposes. One program 
employed a scheme for reducing a complex net of sociometric data to the 
prominent cliques. A second program ascertained the role of reference 
subjects upon the sociometric choices a subject makes. A third program 
employed the second to perform a prediction of the role 
jects in determining the direction of 
means of the third program, one coul 
would become oriented more toward academic subjects than toward 
athletics over a period of time. He also suggested that stability and in- 
stability in three-person groups could be simulated with a computer 
program. 
A social-behavior model has been the basis for an IPL-V simulation 
program of two-person interaction (Gullahorn and Gullahorn, 1962). The 
propositions of the model were translated into IPL-V subroutines. 
represented the basic information-pro 
jects. Although both subjects had at th 


of reference sub- 
development of a social system. By 
d ascertain whether a student body 


which 
cessing mechanisms of the two sub- 


cir disposal identical mechanisms, the 
behavior of each subject was unique because of the symbols, the attributes, 


and the attribute values in the lists possessed by each. Whereas Laughery 
and Gregg's (1962) program simulated a particular subject and the EPAM 
program implemented a particular psychological theory, the IPL-V pro- 
gram termed HOMUNCULUS, which was prepared by Gullahorn and Gulla- 
horn (1962), was a blend of these two approaches. Like the EPAM pro- 
gram, HOMUNCULUS rests heavily upon a theoretical basis, and yet it is 
behavioristic in nature as is Laughery and Gregg's program. 


Simulation of Person ality 


In June 1962, a conference on computer technology and personality 
theory was held in Princeton at the Educational Testing Service and Prince- 
ton University. In addition to serving the purpose of accelerating the dis- 
semination of information regarding computer simulation among person- 
ality theorists, the conference was planned to attaia some evaluation of the 
potentialities that computer simulation holds for the field of personality. 
The contributions of 19 of the participants were published in a book edited 
by Tomkins and Messick (1963). Needless to say, the volume has great 


heuristic value. The inclusion of a comprehensive bibliography of approxi- 
mately 125 references is also noteworthy, 


Distinction Between Simulation and Synthesis 


Feldman (1962) distinguished betw. 


een simulation and synthesis. k 
simulation, the computer program uses the same mechanisms as the origina 
510 
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species to perform the task. In synthesis, for a given set of inputs the com- 
puter yields outputs identical to those of the species, although it does not 
use the same mechanisms. The point at which synthesis ceases and simula- 
tion begins should generate considerable discussion during the next few 


years. 


Pattern Recognition and Generation 
3 

The scholarly discussion by Uhr (1963) of the major developments in 
pattern recognition devices and programs and of their implications for 
psychological theory cannot be summarized without injustice to the author. 
Uhr's paper is required reading for those interested in the field of pattern 
recognition; its 95 references are an excellent bibliography. In a related 
article, Selfridge and Neisser (1960) provided a well-illustrated explanation 
of how a pattern recognition program operates. 

In contrast to the use of the computer to recognize patterns, Gre! 
wrote a computer program which (a) generated patterns, (b) displayed 
them on a cathode ray tube, and (c) photographed the patterns. The 
photographs were subsequently used as stimuli in perception experiments 
with human subjects. Computer generation of stimuli provides a means for 
precise control of the characteristics of stimuli, such as form, angle of 
rotation, and degree of degeneration required by certain types of experi- 
mentation. 


en (1961) 


Automated Classroom 


October 1961 symposium on the computer-based 
teaching machine were published in a book edited by Coulson (1962). At 
that time, the current state of the art of computer-based teaching machines 
was presented; since then, no significant developments have occurred. The 
Computer-Based Laboratory for Automatic School Systems, known as 
CLASS, was constructed by System Development Corporation. Àn auto- 
mated classroom was developed that employed a medium-scale computer to 
control the simultaneous presentation of programed material to as many as 
20 pupils and a teacher. The student response stations contained not only 
cathode ray tubes for presenting material. which were. controlled by the 
computer programs, but also buttons by which the subject communicated 
his item response choice to the computer. A system somewhat more flexible 
in design than the CLASS system was developed by the Coordinated Science 
Laboratory of the University of Illinois and called PLATO II. However, it 
has not been developed to so high an operational level as the CLASS system. 

Although a number of other smaller systems were also presented, the two 
y the highest level of development in this field. The miss- 
he computer-based teaching machine developments 
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has been a concurrent educational research program. Tt has Te 
demonstrated that the computer-based teaching machine is more i o» 
as a learning experience than are traditional elementary school workbooks. 
Additional research should seek those situations in which the unique 


J : e E E 
capabilities of the computer-based teaching machine can make maximur 
contribution. 


Simulation in Educational Administration 
$ 


Although computer simulation of systems such as fiscal needs, personncl 
utilization, and determination of plant location is commonplace in industry, 
work pertaining to analogous simulation of educational systems has not 
been published. There are many aspects of educational administration 
which appropriately could be studied via computer simulation: for example, 
the simulations of automobile traffic performed by the National Bureau of 
Standards (Communications of the ACM, 1962) might provide the basis 
for the simulation of pupil flow to be used in the design of school buildings. 


Hopefully, the next few years will produce some results in this interesting 
area. 


Information Storage and Retrieval 


Information storage and retrieval, the so-c 
been an area of continuing effort for a number 
existing information-retrieval systems, 
gramming languages hay 
retrieval, that 10 infor 
brought to the operatior 
in the research or exper 

One of the most inter 
called Baseball. Devise 
present natural text 
for example, How 


Baseball system, the history of baseball 
and these are mani 


alled library problem, has 
of years. In a survey of the 
Grems (1962) reported that 3 pro- 
e been developed especially for use in information 
mation storage and retrieval systems have been 


id aug e 
nal stage, and that 7 systems are considered to b 
imental stages. 


request. 
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Through means of a factor analysis of key words, Borko (1962a) devel- 
oped an automatic classification of a corpus of psychological documents. 
The documents were assigned to categories on the basis of their factor load- 
ings, and the resulting classifications were compared with the classifications 
of the documents affected through use of the system developed by the 
American Psychological Association. Only a moderate degree of agreement 
was found between the two sets of classifications of the documents. 

The research effort concerning the whole area of the library problem 
has been marked by considerable discussion, tentative proposals, and use 
of heuristic schemes, rather than by generally useful operational systems. 
A theory of information retrieval is needed to provide a conceptual frame- 
work from which adequate information storage and retrieval systems can be 
developed—hopefully, in time to cope with the publication explosion. 


Theory Development 


As the noncomputational uses of computers have developed, it has become 
increasingly apparent that the computer is going to have considerable 
impact upon the development of psychological and social theories. Because 
of the small-step processes by which computers function, the theory must 
be specified precisely—not a trivial task. The nebulous theories prevalent 
in the educational world cannot survive the cold realities of programming 
for a digital computer. As a case in point, Uhr (1963) stated that major 
difficulties have been encountered in recent efforts to put the theories of 
Hebb and Hull into a precise form that would permit them to be tested. 
Uhr also indicated that computer programs are on the verge of becoming 
extremely important theoretical tools and that, unless psychologists take an 
active role in program construction, they will be excluded from an important 
development in their own field. This caution can be directed with even 
greater emphasis to education, for which the majority of theoretical deyel- 
opments are the products of persons outside the field. 

As computer programs become increasingly prevalent vehicles for ex- 
pressing psychological theories, an increased burden of documentation 
will be placed upon those publishing accounts of computer programs, At 
the present time, the authors of articles involving IPL-V programs, with a 
few exceptions, have been negligent both in indicating the assumptions 
made by the programmer to implement the problem and in providing a 
detailed account of the course of the simulated experiment. Without such 
documentation, effective ‘evaluation of the theory involved is not possible. 


Developments in Programming Languages 


There has been a line of continuing development in programming lan- 
guages; it has included absolute coding, regional assignment coding, sym- 
bolic.assembly programs, and the so-called machine independent languages. 
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Bromberg (1963) surveyed the existing programming languages and found 
examples of all levels in use at the present time, although the absolute s n 
regional assignment systems, which have been supplanted by sophistica e 
symbolic assembly routines, are presently considered obsolete. The vani 
in turn, are being supplanted by the “higher-level” languages iny olving 
formula translation, such as FORTRAN. After a rather slow beginning, the 
FORTRAN language, described by McCracken (1961), has become the 
standard scientific programming language of the large-scale computers. 
It is also available for a number of small machines. Because of the limita- 
tions of FORTRAN, many programmers prefer such systems as IBM s 
Fortran Assembly Program (FAP) or Control Data's Fortran Symbolic. 
which allows one to combine in a single program the simplicity of the FOR- 
TRAN language and the programming flexibility and power of the symbolic 
assembly form of machine language. 
A considerable amount of material h 


as been published concerning the 
development of an algorithmic computer 


language (ALGOL), which could 
serve as a programming language and as a standard publication language: 
After a number of years of discussion, this language, ALGOL 60, has not 
gained wide usage because of unresolved difficulties in its machine imple 
mentation (Naur, 1963). However, this effort has a number of offshoots. 
The language termed MAD is an example (Galler, 1962). 


For the next few years, at least, FORTRAN probably will continue to be 
the closest approximation to a universal programming language. The pro- 
liferation of manuals on the fundamentals of FORTRAN programming has 
enabled one to learn easily the bare facts of FORTRAN but not the mean? 
for integrating them into a computer program. The book by Cooley anc 
Lohnes (1962) is unique in that the authors presume that the reader 
knows FORTRAN and illustrate in detail how to use these fundamentals 
to develop 


programs for some rather complex statistical analyses. 


Need for Improvement in the Selection and 
Training of Computer Programmers 


The increasing reliance u 


pon computers in all areas of research is creat 
ing a universal demand for persons skilled in the use of these machines 
sufficient numbers of programmers are to be available in the future, the 
skills, traits, and attitudes of an effective programmer must be ascertainec 
Peres and Arnold (1963) made an initial attempt to identify programmer 
behavior. A high degree of tolerance for frustration appears to be a major 
characteristic of skilled progr. 


i ifica- 
| ammers, Further inquiry into the pat 
tion of programming talent and into de 


: a velopment of instruments to pres uf 
success in programming would certainly be an appropriate and significa 
task for educational research. ; 

Several university-level courses, such as those proposed by Baker (1962) 
and Borko (1962b). have been developed to provide present and futur 
EYE 
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behavioral scientists with a working knowledge of existing library pro- 
grams, of programming fundamentals, and of the role of computers in 
research. The book edited by Borko (1962b) on computer applications in 
the behavioral sciences could serve as the focal point for such a course. 
Feigenbaum (1962) proposed a course to develop the understanding and 
skills necessary to use IPL-V in programming simulation problems. Such 
courses as the above represent a sharp break from the traditional mathe- 
| matically oriented programming course. They should do much to imple- 
ment an intelligent integration of computers into educational research. 
Another attempt to increase the supply of computer programmers has 
been to teach a wide range of programming topics to bright high school 
students (Heller, 1962). Although such courses now exist only in a few high 
schools, an expansion of this program could considerably increase the 
supply of skilled programmers. 


Summary 

The period covered by this Review has been marked by rapid develop- 
ments on a number of fronts in computer technology and applications. Many 
of these developments may be expected to be reflected in educational 
research. The increase in the absolute numbers and in the power of the 
computers available to educational researchers has far exceeded fondest 
expectations of a few years ago. The widespread availability of general- 
ized statistical programs has made routine practice of such computational 
procedures as analysis of variance, factor analysis, and item analysis. The 
time and effort previously expended upon the mechanics of calculation or 
| upon special case programming now can be devoted to design and execution 


of research. 
4 The current embryonic interest in educational data processing, as indi- 
| cated by Yasaki (1963), parallels the growth of business data processing 
some years ago. Hopefully, educators will profit from the considerable 
experience that industrial managers have so painfully acquired. Educational 
data processing, once developed, could influence constructively both infor- 
mation processing in educational research and the instructional process 
itself. College and university educators are in a particularly advantageous 
position to help meet the critical demand for programmers—through their 
development of instruments to predict success in programming, ong 
their design of and participation in experiments aimed at improving meth- 
n ods for teaching programming, and through their influence in requiring 
| courses to be a part of the training of social scientists, i 

There ar any areas of com; r icati i 

affect the dodi of ease ee i end em 
l [ : S well as educational. 
psychological theory. The simulation of cognitive behavior, the study of 
small-group interaction, and the analysis of the nature of social systems— 
all these offer educators a fertile field within which unique contributions 


DD] 


| oo 


SERERE z 
REVIEW OF EDUCATIONAL RESEARCH Fol. XXXIII, No. 5 


could be made. Unfortunately, since educators have been remiss, perans 
without a primary commitment to education have been Leda i er 
pioneering research. Present trends in theory construction seem to indi in 
that many future discussions of psychological theory will be center ; 
around the details of a computer program and around the implications o 
these details relative to the performance of the program. To keep abreast 
of this development, educational researchers will have to become increas- 
ingly well informed about a number of different programming systems. 
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matching rule, 547; variety of statistics, 
applied, 548 
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Models in statistical learning, important 
extensions: concept formation in chil- 
dren, 550; reaction time, 551; response 
magnitude, 550; social psychology, 549 

Models in statistical learning, types of: 
incremental, 546; pattern, 544; pattern, 
generalized, 546; strategy, 546 

Morale, opinions, and attitudes of 
teachers: 409 : 

Multidimensional scaling: applications, 
538; individual differences, 538 

Multifactor designs without repeated 
measurements: Latin square designs, 
495; randomized block designs, 493; 
response surface and related designs, 
494 

Multiple comparisons: contributions to, 
485; separation of populations in rela- 
tion to, 495 

Multiple handicaps in relation to: deaf 
and hard of hearing, 56; visually 
handicapped, 43 

Multivariate analysis of variance: 497 

Need and preparation of teachers 1n 

higher education: 39] . 

New and promising areas of research: in 
curriculum, 332; in procedures, 333; in 
teacher personnel, 445 

New and promising procedures of re- 
search in teacher personnel: 446 

New media in teacher education: 374 


Null hypothesis, rejection or acceptance 
of: 467 


Occupational classification: 200 1 

Occupational information, sources of: 20 

Occupational psychology: 199 

Occupations units, career contene 
and vocational guidance in regula 
course work: 210 id 

Organization and administration of guid 
ance services in relation to: spege” 
services, 154; teacher vs. specialist par 
ticipation, 155; timing in introducing 
guidance services, 155; types of coun- 
selor assignments, 153; use of data- 
Processing equipment, 154; use of ra 
tios, 153 

Organization of special education: n 
federal level, 8; at local level, 9; 4 
state level, 8; extension of programs 
and services in relation to, 10; pogre 
gated and integrated programs of in 
struction in relation to, 9 j 
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Orientation and special guidance courses: 
209 


Parametric and nonparametric statistics, 
selected contributions to: books, in 
statistical methodology, 474; contin- 
gency tables, 484; general contributions 
to the theory of statistical inference, 
475; inference concerning authorship, 
479; multiple comparisons, 485; sig- 
nificance tests, 480; theory of estima- 
tion, 477; theory of statistical infer- 
ence, 475 

Parent groups, counseling with: 208 

Perception-diagnosis theory of teaching: 
419 

Personality characteristics: in teacher 
selection, 361 

Personality structure in relation to dy- 
namics of change in curriculum: 311 

Philosophical basis for inference from 
experimental studies: 490 

Philosophical trends in guidance, counsel- 
ing, and personnel services: 146 

Physies: 281 

Power and efficiency: 482 

Preparation and  professionalization of 
specialists: 175 

Preparation of counselors: 172 

Pre-service and in-service education for 
teachers: campus laboratory school, 
role of, 377; elementary teachers, con- 
cerning, 370; in-service programs, 373; 
new media, with reference to, 374; 
needed research in, 378; professional 
laboratory experiences, concerning, 
375; secondary teachers, concerning, 
371 

Pre-service programs: 370 

Problems of design in curriculum re- 
search: 324 

Problems of robustness, power, and effi. 
ciency, miscellaneous contributions to: 
482 : 

Professionalization of counselors in edu- 
cational settings: 174 

Program evaluation: control group stud- 
les, 216; criterion problem in, 218; 
follow-up studies, 215; general discus. 
sions, 214; methodological problems in, 
218; problems of experimental design 
in, 220 

Programed instruction for the mentally 
retarded: arithmetic studies, 133; 
automated curriculum study, 135; com. 


parison of classroom and programed 
learning techniques, 130; confirmation 
procedures, 131; inductive reasoning 
study, 135; intelligence and step-size 
study, 131; language studies in, 126; 
picture program in teaching sight 
vocabulary, 127; sentence program us- 
ing verbal prompts, 129; spelling, 132; 
types of projects in, 126; use of re- 
tarded tutors, 128; word learning, 130 

Psychological forces influencing curricu- 
lum decisions: learner, 268; learning 
outcomes, 272; learning process, 269; 
research design and psychological in- 
fluences, 273; school learning situation, 
270 

Psychophysics: 534 

Published works: of critical significance 
in area of teacher personnel, 443 

Pupil behaviors: products of instructor 
behaviors, 429; pupil achievement and 
learning, 430 


Randomized block design: 493 

Recruitment: age at making decision to 
be teacher, 356; factors in decision 
making, 356 

Recruitment and selection of prospec- 
tive elementary and secondary school 
teachers: factor studies, 364; recruit- 
ment, aspects of, 355; selection, factors 
pertaining to, 358 

Regression analysis: accretion and de- 
letion | procedures, 503;  B-nonlinear 
models, 502; computer analysis, 505; 
miscellaneous topics, 505; regression 
models, 501; x-nonlinear models, 502 

Regression and correlation: correlation 
analysis, 506; regression analysis, 501 

Relational contexts in curriculum re- 
search: 322 


Research designs and psychological in- 
fluences: 273 


Research. needed concerning: conditions 
of employment and service in elemen- 
tary and secondary schools, 388; pre- 
service and in-service teacher education, 
378; teacher certification, supply, and 


demand, 351; understanding teacher 
behavior, 433 

Research procedures, new and promising: 
333 ` 


Research theory, nature of: analysis of 
problems, 235; natural history stage 
236; stages of inquiry, 234 ; 
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Response surface and related designs: 
94 

one of teachers: 385 

Robustness: problems of robustness, 
power, and efficiency, miscellaneous 
contributions to, 482; t-test with the 
Wilcoxon-Mann-Whitney U-test, com- 
parison of, 480 

Roles and functions of pupil personnel 
workers in relation to: health services, 
156; nomenclature of guidance services, 
157; psychological services, 157; röle 
and function of the counselor, 158; 
self-perceptions of counselors, 157 

Roles and role perception in relation to 
dynamics of change in curriculum: 311 


Salaries and other benefits: in elemen- 
tary and secondary education, 386; in 
higher education, 394; for teachers in 
general, 398; for women teachers, 399; 
in relation to additional employment, 
400 

Sample surveys in education, design of: 
development of statistical theory, 524; 
Furno’s study, 528; general references, 
523; limitations of educational appli- 
cations, 528; Project Talent, 527; 
sampling as means of identifying in- 
formation in census, 527; teacher 
Surveys, 527; use of available lists in 
sampling procedures, 526 

Scaling and measurement 
puter models, 539; history of psycho- 
metrics, 534; information theory, 535; 
mathematics and psychology, 540; 
multidimensional sealing, 536; psycho- 
physics, 534; Teviews and integrative 
summaries, 539; signal detection, 535 

Scheme for classifying teacher behaviors: 
423 

School learning situation: automated in- 
struction and teaching machines, 271; 
group interaction, 272; task and method 
variables, 271; teacher, 270 

Science: chemistry, 282; physics, 281 

Secondary school teacher education: 371 

Selection, factors pertaining to: academic 
competence, 359; career commitment, 
361; criteria, 358; Personality charac- 
teristics, 361 

Selection of personnel specialists: 17] 

Selection, preparation, and professionali. 
zation of specialists: 17] 

Signal detection: 535 
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theory: com- 


Significance tests: 
butions to 


miscellaneous contri- 

problems of robustness, 
power, and efficiency, 482; new tests. 
483; parametric versus nonparametric 
tests, 480; robustness of t-test with 
Wilcoxon-Mann-Whitney U-test, 480; 
statistical assumptions in relation to, 
468; statistical inference, use of, for, 
465; statistical significance and sub- 
stantive significance, 467 : 

Social problems in relation to public 
schools: delinquency, 260; dropouts, 
259; economic influences, 262; political 
problems, 261 . 

Social realities in relation to school situa- 
tion: family life, 258; peer culture, 
257; racial and ethnic differences, 254; 
religion, 255; socioeconomic class, 256 

Social status of teachers in relation to: 
educational level, 406; images and 
stercotypes of teachers and teaching, 
407; occupation of father, 407 

Social studies: 287 


Society and culture in guidance, coun- 
seling, and personnel services in rela; 
tion to: college students, 143; agr d 
deprived, 145; school-age youth, 144; 
values, 144 m 

Special classes and related educationa 
Provisions for the emotionally and 
socially handicapped: dynamic tagia 
in therapeutic education, 116; schoo 
consultation and liaison, 117; therapies 
affecting learning, 118 i 

Special education: administration, uu 
background, 5; needed research in, 15: 
organization, 8; supervision, 12 -— 

Special topics in design and analysis S 
experiments; additivity, 496; Ee us 
tions in two-group designs, A975 d. 
Variance analysis, 497; multiple Hes 
parisons and separation of populatio a 
495; multivariate analysis of variance, 
497 . 25; 

Speech handicapped: aphasia in, pral 
articulatory disorders in, 22; cere ion 
palsy in, 27; cleft palate in relati! 
to, 27; incidence of, 20; dengue 
development in, 21; laryngectomy Ha 
27; miscellaneous contributions Es 
study of, 30; research on, € 
31; speech problems related Mrd 
disorders, 29; speech science, 31; $ ch 
tering in, 24; testing and rescar 
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techniques for, 20; voice disorders in, 
23 

Staff organization and utilization: 383 

Statistical hypothesis testing, problenss in: 
data analysis and statistical hypoth 
testing, 469; decision theory and con- 
clusion theory, 467; null hypothe 
rejection or acceptance of, 467; sig- 
nificance tests andestatistical assump- 
tions, 468; statisti ignificance and 
substantive significance, 467; 
and experimentation, °469; 
and psychological theory, 470 

Statistical inference: Bayesian statistical 
theory, contributions of, 462; Fisher, 
contributions of, 458; hypothesis test- 
ing in relation to, 455; likelihood prin- 
ciple in relation to, 464; Neyman and 
Pearson, contributions of, 458; sig- 
nificance tests, use of, 465; theories of, 
in relation to significance tests, 4. 
theory of, general contributions to, 4 
Wald, contributions of, 460 

Statistical learning models: alterations 
of older models, 544; empirical tests, 
547; extensions, 549; other models, 
551; prospectus, 552 

Statistics: experimentation in relation to, 
469; psychological theory in relation to, 
470 


Structures and groupings of disciplines: 
249 

Studies of critical significance in curricu- 
lum planning and development: 330 

Summary of teacher-student variables: 
considered in the literature, 424 

Supervision in special education: com- 
munity and parent relationships in 
relation to, 14; evaluation of instruc- 
tion as problem in, 12; guidance, 
placement, and follow-up, 13 

Supply and demand: econamic factors, 
347; needed research in, 351; remedies 
for the shortage, 350; subject and grade 
level factors, 348; other factors, 349 

Surveys: see sample surveys ja education 


as 


Task and method variables: 271 

Teacher: 270 ~ = 

Teacher and teaching effectiveness: char- 
acteristics, 296; personality, 295 

Teacher as problem solver: model for, 421 

Teacher attitudes and opinions: 409 


Teacher-board relations: legal aspects of, 
404- 


^ 


Teacher certification, supply, and de. 
mand: 343 

Teacher educatión: 369 

Teacher effectiveness: 431 

Teacher examinations: 345 

Teacher load: 384 

"Teacher morale: 387 

Teacher personnel research in higher 
education: need and preparation of 
teachers, 391; new data sources con- 
gerning, 396; ratings and evaluation 
in, 392; salaries and other benefits, 
pertaining to, 394 

Teacher-pupil relations: legal aspects of, 
404 

Teacher status and role expectations: 382 

Teacher-student variables in relation to 
systems approach: 424 

Teaching: analysis of teaching act, 293; 
as system of social action, 419; teacher 
and teaching effectiveness, 295; tech. 
nology of, 301 

Team teaching: 288 

Technology of teaching: 301 

Tenure of teachers: 402 

Test taking and its preparation in rela- 
tion to: instructions, 188; perception, 
189; practice, 188; screening, 189 

Theory for education: defined, 239; in 
educational research, 238; in relation 
to curriculum development, 238; usages 
of the term, 237 

Theory of curriculum planning and de- 
velopment: 334 

Theory of estimation: interval estimation, 
478; point estimation, 477 

Theory of instruction. and teacher be. 
havior: contributions in general, 445; 
information System model, 420; model 
viewing teacher as problem solver, 421; 
perception-diagnosis theory of teaching, 
419; teaching as System of social action 
419 j 

Theory of statistical inference, general 
contributions: AERA.sponsored publi- 
cations, 476; mathematica] statisticians, 
477; psychological literature, | 476: 
three basic contributions, 475 , 

Theory making in education: incomplete 
analogy, 242; models, 240 

Track organization: 289 


Visually handicapped: general works 
concerning, 38; identification and clas. 
sification of, 38; mobility of, 41; 
multiply handicapped in relation to, 
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43; professional preparation of workers 
with, 44; reading and writing by, 40; 
social psychology regarding, 42; tests 
and psychological studies concerning, 
4l 


Vocational development process: career 


556 


development, 197; occupational classi- 
fication, 200; occupational information, 
sources of, 201; occupational psychol- 
ogy, 199: women's career development, 
200 

Vocational guidance in regular course 
work, 210 


